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PROTEINS, POLYNUCLEOTIDES ENCODING THEM AND METHODS 

OF USING THE SAME 

FIELD OF THE INVENTION 

5 The invention relates to polynucleotides anq 1 the polypeptides encoded by such 

polynucleotides, as well as vectors, host cells, antibodies and recombinant methods for 

producing the polypeptides and jDolynucleotides, as well as methods for using the same. 

i i 

BACKGROUND OF THE INVENTION 

The present invention is based in pail on nucleic acids encoding proteins that are new 
10 members of the following protein families: TEN~M4-like, Semphorin-like, Erythroid 

membrane associated-like, Vitelline membrane outer layer 1 precursor-like, MAST205-like, 
Kilon-like, Mixed lineage kinase 2-like, S-l like, Guanine Nucleotide Releasing-like, 
hated eukin-1 like, Interleukin-1 signal transducer-like, GPCR-like, Glucuronosyl transferase- 
like, Prostasin-like, LDLR-like, TNFR-like, TRAF5-Iike, Ferritin light chain-like, 
15 Neurotrophin-6 alpha-like and Methionyl Aminopeptidase-like. More particularly, the 

invention relates to nucleic acids encoding novel polypeptides, as well as vectors, host cells, 
antibodies, and recombinant methods for producing these nucleic acids and polypeptides. 

SUMMARY OF THE INVENTION 

The invention is based in part upon the discovery of nucleic acid sequences encoding 
20 novel polypeptides. The novel nucleic acids and polypeptides are referred to herein as NOVX, 
or NOV1, NOV2, NOV3, NOV4, NOV5, NOV6, NOV7, NOV8, NOV9, NOV10, NOV 1 1 . 
NOV12, NOV13, NOV14, NOV15, NOV16, NOV17, NOV1 8 and NOV19. nucleic acids and 
polypeptides. These nucleic acids and polypeptides, as well as derivatives, homologs, analogs 
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and fragments thereof, will hereinafter be collectively designated as "NOVX" nucleic acid or 
polypeptide sequences. 

In one aspect,' the 'invention provides an isolated NOVX nucleic acid molecule 
encoding a NbVX polypeptide that includes a nucleic acid sequence that has identity to the 
5 nucleic acids disclosed in SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 
33, 34, 35, 37, 39, 41, 43, 45, 47 and 49. In some embodiments, the NOVX nucleic acid 
molecule will hybridize under stringent cdnditions to a' nucleic acid sequence complementary 
to a nucleic acid molecule that includes a protein-coding sequence of a NOVX nucleic acid 
sequence. The invention also includes an isolated nucleic acid that encodes a NOVX 

1 0 polypeptide, or a fragment, bomolog, analog or derivative thereof. For example, the nucleic 
acid can encode a polypeptide at least 80% identical to a polypeptide comprising the amino 
acid sequences of SEQ IDNOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 21, 24, 26, 28, 30, 32, 36, 38, 
40, 42, 44, 46, 48 and 50. The nupleic acid can be, for example, a genomic DNA fragment or 
a cDNA molecule that includes the nucleic acid sequence of any "of SEQ ID NOS:l, 3, 5, 7, 9, 

15 11, 13, 15, 17, 19, 21,23,25,27,29, 31,33, 34, 35, 37, 39, 41, 43, 45, 47 and 49. 

Also included in the invention is an oligonucleotide, e.g., an oligonucleotide which 
includes at least 6 contiguous nucleotides of a NOVX nucleic acid (e.g., SEQ ID NOS:l, 3, 5, 
7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 34, 35, 37, 39, 41, 43, 45,'47 and 49) or a 
complement of said oligonucleotide. Also included in the invention are substantially purified 

20 NOVX polypeptides (SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 
38, 40, 42, 44, 46, 48 and 50). In certain embodiments, the NOVX polypeptides include an 
amino acid sequence that is substantially identical to the amino acid sequence of a human 
NOVX polypeptide. 

The invention also features antibodies that immunoselectively bind to NOVX 

25 polypeptides, or fragments, homologs, analogs or derivatives thereof. 

In another aspect, the invention includes pharmaceutical compositions that include 
therapeutically- or prophylactically-effective amounts of a therapeutic and a phannaceutically- 
acceptable carrier. The therapeutic can be, e.g., a NOVX nucleic acid, a NOVX polypeptide, 
or an antibody specific for a NOVX polypeptide. In a further aspect, the invention includes, in 

30 one or more containers, a therapeutically- or prophylactically-effective amount of this 
pharmaceutical composition. 

In a further aspect, the invention includes a method of producing a polypeptide by 
culturing a cell that includes a NOVX nucleic acid, under conditions allowing for expression 

2 
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of the NOVX polypeptide encoded by the DNA. If desired, the NOVX polypeptide can then 
be recovered. 

In another aspect,'ihe indention includes a method of detecting the'plresence of a 
NOVX polypeptide in a sample. In the method, a sample is contacted with a compound that 
selectively binds to the polypeptide under conditions allowing for formation of a complex 
between the polypeptide and the compound. The complex is detected, if present, thereby 
identifying the NOVX polypeptide within the sample. 1 ' ' , 

The invention also includes methods to identify specific cell or tissue types based on 
their expression of a NOVX. 

Also included in the invention is a method of detecting the presence of a NOVX 
nucleic acid molecule in a sample by contacting the sample with a NOVX nucleic acid probe 
or primer, and detecting whether the nucleic acid probe or primer bound to a NOVX nucleic 
acid molecule in the sample. , , 

In a further aspect, the invention provides a method for modulating th£ activity of a 
NOVX polypeptide by contacting a cell sample that includes the NOVX polypeptide with a 
compound that binds to the NOVX polypeptide in an amount sufficient to modulate the 
activity of said polypeptide. The compound can be, e.g., a small molecule, such as a nucleic 
acid, peptide, polypeptide, peptidomimetic, carbohydrate, lipid or other organic (carbon 
containing) or inorganic molecule, as further described herein. 

Also within the scope of the invention is the use of a therapeutic in the manufacture of 
a medicament for treating or preventing disorders or syndromes including, e.g., trauma, 
regeneration (in vitro and in vivo), viral/bacterial/parasitic infections, Von Hippel-Lindau 
(VHL) syndrome, Alzheimer's disease, stroke, Tuberous sclerosis, hypercalcemia, Parkinson's 
disease, Huntington's disease, Cerebral palsy, Epilepsy, Lesch-Nyhan syndrome, Multiple 
sclerosis, Ataxia-telangiectasia, Leukodystrophies, behavioral disorders, addiction, anxiety, 
pain, actinic keratosis, acne, hair growth diseases, allopecia, pigmentation disorders, endocrine 
disorders, connective tissue disorders, such as severe neonatal Marfan syndrome, dominant 
ectopia lentis, familial ascending aortic aneurysm, isolated skeletal features of Marfan 
syndrome, Shprintzen-Goldberg syndrome, genodennatoses, contractural arachnodactyly, 
inflammatory disorders such as osteo- and rheum atoid-arthritis, inflammatory bowel disease, 
Crohn's disease; immunological disorders, AIDS; cancers including but not limited to lung 
cancer, colon cancer, Neoplasm; adenocarcinoma; lymphoma; prostate cancer; uterus cancer, 
leukemia or pancreatic cancer; blood disorders; asthma; psoriasis; vascular disorders, 
hypertension, skin disorders, renal disorders including Alport syndrome, immunological 
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disorders, tissue injury, fibrosis disorders, bone diseases, Ehlers-Danlos syndrome type VI, 
" VII, type IV, S-3inked cutis laxa and Ehlers-Danlos syndrome type V, osteogenesis imperfecta, 
Neurologic diseases, Brain and/or autoimmune disorders like encephalomyelitis, 
neurodegenerative disorders, immune disorders, hematopoietic disorders, muscle disorders, 
5 inflammation and wound repair, bacterial, fungal, protozoal and viral infections (particularly 
infections caused by HFV-1 or HIV-2), pain, acute heart failure, hypotension, hypertension, 
urinary retention, osteoporosis, Treatment of Albright Hereditary Osteodystrophy, angina 
pectoris, myocardial infarction, ulcers, benign prostatic hypertrophy, arthrogryposis multiplex 
congenita, osteogenesis imperfecta, keratoconus, scoliosis, duodenal atresia, esophageal 
1 0 'atresia, intestinal malrotation, Pancreatitis, Obesity Systemic lupus erythematosus, 

Autoimmune disease, Emphysema, Scleroderma, allergy, ARDS, Neuroprotection, Fertility 
■ Myasthenia gravis, Diabetes, obesity, Growth and reproductive disorders Hemophilia, 
Hypercoagulatiom Idiopathic thrombocytopenic purpura , Immunodeficiencies, Graft vesus 
host, Adrenoleukodystrophy , Congenital Adrenal Hyperplasia, Endometriosis, Xerostomia, 
1 5 Ulcers, Cirrhosis, Transplantation, Diverticular disease, Hirschsprung's disease, Appendicitis, 
Arthritis, Ankylosing spondylitis, Tendinitis, Renal artery stenosis, Interstitial nephritis, 
Glomerulonephritis, Polycystic kidney disease, erythematosus, Renal tubular acidosis, IgA 
nephropathy, anorexia, bulimia, psychotic disorders, including anxiety, schizophrenia, manic 
depression, delirium, dementia, severe mental retardation and dyskinesias, such as 
20 Huntington's disease and/or other pathologies and disorders of the like. 

The therapeutic can be, e.g., a NOVX nucleic acid, a NOVX polypeptide, or a NOVX- 
specific antibody, or biologically-active derivatives or fragments thereof. 

For example, the compositions of the present invention will have efficacy for treatment 
of patients suffering from the diseases and disorders disclosed above and/or other pathologies 
25 and disorders of the like. The polypeptides can be used as immunogens to produce antibodies 
specific for the invention, and as vaccines. They can also be used to screen for potential 
agonist and antagonist compounds. For example, a cDNA encoding NOVX may be useful in 
gene therapy, and NOVX may be useful when administered to a subject in need thereof By 
way of non-limiting example, the compositions of the present invention will have efficacy for 
30 treatment of patients suffering from the diseases and disorders disclosed above and/or other 
pathologies and disorders of the like. 

The invention further includes a method for screening for a modulator of disorders or 
syndromes including, e.g., the diseases and disorders disclosed above and/or other pathologies 
and disorders of the like. The method includes contacting a test compound with a NOVX 
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polypeptide and determining if the test compound binds to said NOVX polypeptide. Binding 

1 1 "* " ■ . 1 • 

of the test compound to the NOVX polypeptide indicates the test compound is a modulator of 

activity, or of latency or predisposition to the aforementioned disorders or syndromes. 

Also within the scope of the invention is a method for screening for a modulator of 

5 activity, or of latency or predisposition to disorders or syndromes including, e.g., the diseases 

and disorders disclosed above and/or other pathologies and disorders of the like by 

administering a test compound td a test animal at increased risk for the aforementioned : 

disorders or syndromes. The test animal expresses a recombinant polypeptide encoded by a 

NOVX nucleic acid. Expression or activity of NOVX polypeptide is then measured in the test 

10 animal, as is expression or activity of the protein in a control animal which recombinantly- 

expresses NOVX polypeptide and is not at increased risk for the disorder or syndrome. Next, 
the expression of NOVX polypeptide in both the test animal and the control animal is 
compared. A change in the activity of NOVX polypeptide in the test animal relative to the 
control animal indicates the test compound is a modulator of latency of the disorder or 

15 syndrome. 

In yet another aspect, the invention includes a method for determining the presence of 
or predisposition to a disease associated with altered levels of a NOVX polypeptide, a NOVX 
nucleic acid, or both, in a subject (e.g., a humah subject). The method includes measuring the 
amount of the NOVX polypeptide in a test sample from the subject and comparing the amount 

20 of the polypeptide in the test sample to the amount of the NOVX polypeptide present in a 
control sample. An alteration in the level of the NOVX polypeptide in the test sample as 
compared to the control sample indicates the presence of or predisposition to a disease in the 
subject. Preferably, the predisposition includes, e.g., the diseases and disorders disclosed 
aboye and/or other pathologies and disorders of the like. Also, the expression levels of the new 

25 polypeptides of the invention can be used in a method to screen for various cancers as well as 
to determine the stage of cancers. 

Ln a further aspect, the invention includes a method of treating or preventing a 
pathological condition associated with a disorder in a mammal by administering to the subject 
a NOVX polypeptide, a NOVX nucleic acid, or a NOVX-specific antibody to a subject (e.g., a 

30 human subject), in an amount sufficient to alleviate or prevent the pathological condition. In 
preferred embodiments, the disorder, includes, e.g., the diseases and disorders disclosed above 
and/or other pathologies and disorders of the like. 

hi yet another aspect, the invention can be used in a method to identity the cellular 
receptors and downstream effectors of the invention by any one of a number of techniques 

5 
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commonly employed in the art. These include but are not limited to the two-hybrid system, 
" affinity purification, co-precipitation with antibodies or other specific-interacting molecules. , 
NOVX nucleic acids and polypeptides are further useful in the generation of antibodies 
that bind immuno-specifically to the novel NOVX substances, for use in therapeutic or 
5 diagnostic methods. These NOVX antibodies may be generated according to methods known 
in the art, using prediction from hydrophobicity charts, as described in the "Anti-NOVX 
Antibodies" section below. The disclosed NOVX proteins have multiple hydrophilic regions, 
each of which can be used as an immunogen. These NOVX proteins can be used .in assay 
systems for functional analysis Of various human disorders, which will help in understanding 
10 'of,pathology of the disease and development of new drug targets for various disorders. 

The NOVX nucleic acids and proteins identified here may be useful in potential 
• therapeutic applications implicated in (but not limited to) various pathologies and disorders as 
indicated below. The potential therapeutic applications for this invention include, but are not 
limited to: protein therapeutic, small molecule drug target, antibody target (therapeutic, 
15 diagnostic, drug targeting/cytotoxic antibody), diagnostic and/or prognostic marker, gene 

therapy (gene delivery/gene ablation), research tools, tissue regeneration in vivo and in vitro of 
all tissues and cell types composing (but not limited to) those defined here. 

Unless otherwise defined, all technical and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary skill in the art to which this invention 
20 belongs. Although methods and materials similar or equivalent to those described herein can 
be used in the practice or testing of the present invention, suitable methods and materials are 
described below. All publications, patent applications, patents, and other references 
mentioned herein are incorporated by reference in their entirety. In the case of conflict, the 
present specification, including definitions, will control. In addition, the materials, methods, 
25 and examples are illustrative only and not intended to be limiting. 

Other features and advantages of the invention will be apparent from the following 
detailed description and claims. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides novel nucleotides and polypeptides encoded thereby. 
30 Included in the invention are the novel nucleic acid sequences and their encoded polypeptides. 
The sequences are collectively referred to herein as "NOVX nucleic acids" or "NOVX 
polynucleotides" and the corresponding encoded polypeptides are referred to as "NOVX 
polypeptides" or "NOVX proteins." Unless indicated otherwise, "NOVX" is meant to refer to 
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any of the novel sequences disclosed herein. Table A provides a summary of the NOVX 
nucleic acids and their encoded polypeptides. 

TABLE A. Sequences and Corresponding SEQ ID Numbers 



NOVX 
Assignment 


Internal Identification 


SEQ ID 

NO 
(nucleic 

acid) 


SEQ ID NO 
(polypeptide) 


Homology 


1 


20422974 0 132 dal 


1 


2 


TEN-M4 like 


2 


CG56091-01 


3 


4 


Semphorin like 


3 


CG50351 01 


5 


6 


Erythroid membrane associated like 


4a 


133268995„dal, 


7 


8 


Vitelline membrane outer layer I 
precursor like 


4b 


CG56375-04 


9 


10 


Vitelline membrane outer layer 1 1 
precursor like 


5 


CG56089-01 


11 


12 


MAST205 like 


6 


CG56087-0J 


13 


14 


Kilon like 


7 


CG56071-01 


15 


16 


Mixed lineage kinase 2 like 


8 


CG56042-01 


17 


18 


S-l like 


9 


CG55997-02 


19 


20 


Guariine Nucleotide Releasing like 


10 


CG56083-01 


21 


22 


Interleukin-1 like 


11 


CG56093-01 


23 


24 I 


Interleukin-l signal transducer-like 


12 


CG56138-01 


25 


26 


GPCR like 


13 


CG56097-01 


27 


28 


Glucuronosyl transferase like 


14a 


CG56123-01 


29 


30 


Prostasin like t 


14b 


CG56123-02 


31 


32 


Prostasin like 




162262711 


33 




Prostasin like 




162262716 


34 




Prostasin like 


15 


CG50153-01/ 
AC025263 da2 


35 


36 


LDLR like 


16a 


CG561 08-01 


37 


38 


TNFR like 


16b 


CG561 08-02 


49 


50 


TRAF5 like 


17 


CG56101-01 


39 


40 


Ferritin light chain like 


18 


CG56095-03 


41 


42 


Neurotrophin-6 alpha like 


19a 


CG50287-02 


43 


44 


Methionyl Aminopeptidase like 


19b 


CG502S7-01 


45 


46 


Methionyl Aminopeptidase like 


19c 


CG50287-03 


47 


48 


Methionyl Aminopeptidase like 



5 NOVX nucleic acids and their encoded polypeptides are useful in a variety of 

applications and contexts. The various NOVX nucleic acids and polypeptides according to the 
invention are useful as novel members of the protein families according to the presence of 
domains and sequence relatedness to previously described proteins. Additionally, NOVX 
nucleic acids and polypeptides can also be used to identify proteins that are members of the 
1 0 family to which the NOVX polypeptides belong. 

The NOVX nucleic acids and polypeptides can also be used to screen for molecules, 
which inhibit or enhance NOVX activity or function. Specifically, the nucleic acids and 
polypeptides according to the invention maybe used as targets for the identification of small 
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molecules that modulate or inhibit, e.g., neurogenesis, cell differentiation, cell proliferation, 
hematopoiesis, wound healing and angiogenesis. 

Additional utilities for the NOVX nuxdeic acids and polypeptides according to the 
invention are disclosed herein. 



5 NOV1 

A disclosed NOV1 nucleic acid of 8438 nucleotides (also referred to as CG56O91 T 01) 
encoding a novel TEN-M4-like protein is shown in Table 1 A. An open reading frame was 
identified beginning with an ATG initiation codon at nucleotides 4-6 and ending with a TGA 
codon at nucleotides S436-8438. A putative untranslated region upstream from the intiation 
10 codon is underlined in Table 1A, and the start and stop codons are in bold letters. 



Table 1A. NO VI nucleotide sequence (SEQ ID NO:l). 

GCCATGGACQTGAAGGAGAGGAAGCCTTACCGCTCGCTGACCCGGCGCCGCGACGCCGAGCGCCGCTACACCAGCTCGTCC" 

gcggacagcgaggagggcaaagcccggcAgaaatcgtacagctccagcgagaccctgaaggcctacgaccaggacgcccgc 

CTAGCCT ATGG CAGC CGCGTCAAGG ACATTG TGCCGCAGG AGGCCG AGG AATTCTGCCGC AC AGGTGC C AACTT CACCCTG 
CGGGAGCTGGGGCTGGAAGAAGTAACGCCCCCTCACGGGACCCTGTACCGGACAGACATTGGCCTCCCCCACTGCGGCTAC 
TCCATGGGGGCTGGCTCTGATGCCGACATGGAGGCTGACACGGTGCTGTCCCCTGAGCACCCCGTGCGTCTGTGGGGCCGG 
AGCACACGGTCAGGGCGCAGCTCCTGCCTGTCCAGCOGGGCCAATTCCAATCTCACACTCACCGACACCGAGCATGAAAAC 
ACTGAGACTGATCATCCGGGCGGCCTGCAGAACCACGCGCGGCTCCGGACGCCGCCGCCGCCGCTCTCGCACGCCCACACC 
CCCAACCAGCACCACGCGGCCTCCATTAACTCCCTGAACCGGGGCAACTTCACGCCGAGGAGCAACCCCAGCCCGGCCCCC 
ACGGACCACTCGCTCTCCGGAGAGCCCCCTGCCGGCGGCGCCCAGGAGCCTGCCCACGCCCAGGAGAACTGGCTGCTCAAC 
AGCAACATCCCCCTGGAGACCAGAAACCTAGGCAAGCAGCCATTCCTAGGGACATTGCAGGACAACCTCATTGAGATGGAC 
ATTCTCGGCGCCTCCCGCCATGATGGGGCTTACAGTGACGGGCACTTCCTCTTCAAGCCTGGAGGCACCTCCCCGCTCTTC 
TGCACCACATCACCAGGGTACCCACTGACGTCCAGCACAGTGTACTCTCCTCCGCCCCGACCCCTGCCCCGCAGCACCTTC 
G CCTGG C CGG CCTTT AAC CTC AAGAAGCCCTC CAAGTACTGT AACTGGAAGTG CG CAG CCCTG AG CGCC ATCGTCATCTCA 
GCCACTCTGGTCATCCTGCTGGCATACTTTGTGGCCATGCACCTGTTTGGCCTAAACTGGCACCTGCAGCCGATGGAGGGG 
CAGATGACGGATTTATGAGATCACGGAGGACACAGCCAGCAGTTGGCCTGTGCCAACCGACGTCTCCCTATACCCCTCAGG 
GGGCACTGGCTTAGAGACCCCTGACAGGAAAGGCAAAGGAACCACAGAAGGAAAGCCCAGTAGTTTCTTTCCAGAGGCCAG 
TTTCATAGATTCTGGAGAAATTGATGTGGGAAGGCGAGCTTCCCAGAAGATTCCTCCTGGCACTTTCTGGAGATCTCAAGT 
GTTCATAGACCATCCTGTGCATCTGAAATTCAATGTGTCTCTGGGAAAGGCAGCCCTGGTTGGCATTTATGGCAGAAAAGG 
CCTCCCTCCTTCACATACACAGTTTGACTTTGTGGAGCTGCTGGATGGCAGGAGGCTCCTAACCCAGGAGGCGCGGAGCCT 
AGAGGGGACCCCGCGCCAGTCTCGGGGAACTGTGCCCCCCTCCAGCCATGAGACAGGCTTCATCCAGTATTTGGATTCAGG 
AATCTGGCACTTGGCTTTTTACAATGACGGAAAGGAGTCAGAAGTGGTTTCCTTTCTCACCACTGCCATTCTTGATTCCTG 
GGCTCTCTGTTTGGGTGATGGAGAATGCGTTTCTGGAACTTGCCATTGTTTTCCAGGATTTCTGGGTCCGGATTGTTCAAG 
AGCCGCCTGTCCAGTGTTATGTAGTGGCAACGGGCAGTACTCCAAGGGCCGCTGCCTGTGTTTCAGCGGCTGGAAGGGCAC 
CGAGTGTGATGTGCCGACTACCCAGTGTATTGACCCACAGTGTGGGGGTCGTGGGATTTGTATCATGGGCTCCTGTGCTTG 
CAACTCAGGATACAAAGGAGAAAGTTGTGAAGAAGCTGACTGTATAGACCCTGGGTGTTCTAATCATGGTGTGTGTATCCA 
CGGGGAATGTCACTGCAGTCCAGGATGGGGAGGTAGCAATTGTGAAATACTGAAGACCATGTGTCCAGACCAGTGCTCCGG 
CCACGGAACGTATCTTCAAGAAAGTGGCTCCTGCACGTGTGACCCTAACTGGACTGGCCCAGACTGCTCAAACGAAATATG 
TTCTGTGGACTGTGGCTCACACGGCGTTTGCATGGGGGGGACGTGTCGCTGTGAAGAAGGCTGGACGGGCCCAGCCTGTAA 
TCAGAGAGCCTGCCACCCCCGCTGTGCCGAGCACGGGACCTGCCGCGACGGCAAGTGCGAGTGCAGCCCTGGCTGGAATGG 
CGAACACTGCACCATCTCCCTAGCTCACTATCTGGATAGGGTAGTTAAACTTTCAGAGGGTTGCCCTGGGTTGTGCAATGG 
CAACGGCAGATGTACCTTAGACCTGAATGGTTGGCACTGCGTCTGCCAGCTGGGCTGGAGAGGAGCTGGCTGTGACACTTC 
CATGGAGACTGCCTGCGGTGACAGCAAAGACAATGATGGAGATGGCCTGGTGGACTGCATGGACCCTGACTGCTGCCTCCA 
GCCCCTGTGCCATATCAACCCGCTGTGCCTTGGCTCCCCTAACCCTCTGGACATCATCCAGGAGACACAGGTCCCTGTGTC 
ACAGCAGAACCTACACTCCTTCTATGACCGCATCAAGTTCCTCGTGGGCAGGGACAGCACGCACATAATCCCCGGGGAGAA 
CCCCTTTGATGGAGGGCATGCTTGTGTTATTCGTGGCCAAGTGATGACATCAGATGGAACCCCCCTGGTTGGTGTGAACAT 
CAGTTTTGTCAATAACCCTCTCTTTGGATATACAATCAGCAGGCAAGATGGCAGCTTTGACTTGGTGACAAATGGCGGCAT 
CTCCATCATCCTGCGGTTCGAGCGGGCACCTTTCATCACACAGGAGCACACCCTGTGGCTGCCATGGGATCGCTTCTTTGT 

catgg aaac c at c at c atg ag ac atg agg ag aatg ag attc c cagc tgtg acctg ag caattttgc ccg c cccaacc c ag t 
cgtctctccatccccactgacgtccttcgccagctcctgtgcagagaaaggccccattgtgccggaaattcaggctttgca 
ggaggaaatctctatctctggctgcaagatgaggctgagctacctgagcagccggacccctggctacaaatctgtcctgag 
gatcagcctcacccacccgaccatccccttcaacctcatgaaggtgcacctcatggtagcggtggagggccgcctcttcag 
gaagtggttcgctgcagccccagacctgtcctattatttcatttgggacaagacagacgtctacaaccagaaggtgtttgg 
gctttcagaagcctttgtttccgtgggttatgaatatgaatcctgcccagatctaatcctgtgggaaaaaagaacaacagt 
gctgcagggctatgaaattgacgcgtccaagcttggaggatggagcctagacaaacatcatgccctcaacattcaaagtgg 
tggcatcctgcacaaagggaatggggagaaccagtttgtgtctcagcagcctcctgtcattgggagcatcatgggcaatgg 
gcgccggagaagcatctcctgccccagctgcaacggccttgctgacggcaacaagctcctggccccagtggccctcacctg 
tgg ctctgacggg ag cctcta tg tggg tgatt tcaacta ca ttag aagga tcttcccctctgg aaa tgtcaccaacatcct 
agagctgagggtcagaaataaagatttcagacatagtcacagtccagcacacaaatactacctggccacagaccccatgag 
tggggccgtcttcctttctgacagcaacagccggcgggtctttaaaatcaagtccactgtggtggtgaaggaccttgtcaa 
gaactctgaggtggttgcg g ggacaggtgaccagtgcctcccctttgatga c actcgctgcggggatggtgggaaggccac 
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AGAAGCCACACTCACCAATCCCAGGGGTCCCCCAGGCATTACAGTGGACAAGTTTGGGCTGATCTACTTCGTGGATGGCAC. 

CATGATCAGACGCATCGATCAGAATGGGATCATCTCCACCCTGCTCGGCTCTAATGATCTCACATCAGCCCGGCCACTCAG 

CTGTGATTCTGTCATGGATATTTCCCAGG^AAGACAGGTTCACCTGGAGTGGCCCACAGACTTAGCCATCAACCCAATGGA 

CAACrCACTTTATGTCCTCGACAACAATGTGGTCCTGCAAATCTerGAAAACCACCAGGTGCGCATTGTCGCCGGGAGGCC' 

CATGCACTGCCAGGTCCCTGGCATTGACCACTTCCTGCTAAGCAAGGTGGCCATCCACGCAACCCTGGAGTf.CAGCCACCGC 

tttggctgtttcacacaatggggtcctgtatattgctgagactgatgagaaaaagatcaaccgcatcag'gcaggtcaccac 
tagtggag^gatctcactcgttgctggggcccccagtggctgtgactgtaaaaatgatgccaactgtgattgtttttctgg 
agacgatggttatgccaaggatgcaaagttaaataccccatcttccttggctgtgtgtgctgatggggagctctacgtggc 

CGACCTTGGGAACATCCGAATTCGGTTTATCCGGAAGAACAAGCCTTTCCTCAACACCCAGAACATGTATGAGCTGTCTTC 

ACCAATTGACCAGGAGCTCTATCTGTTTGATACCACCGGCAAGCACCTGTACACCCAAAGCCTGCCCACAGGAGACTACCT 

GTACAACTTCACCTACACTGGGGACGGCGACATCACACTCATCACAGACAACAATGGCAACATGGTAAATGTCCGCCGAGA 

CTCTACTGGGATGCCCCTCTGGCTGGTGGTCCCAGATGGCCAGGTGTACTGGGTGACCATGGGCACCAACAGTGCACTCAA 

GAGTGTGACCACACAAGGACACGAGTTGGCCATGATGACATACCATGGCAATTCbGGCCTTCTGGCAACCAAAAGGAATGA 

AAACGGATGGACAACATTTTATGAGTACGACAGCTTTGGCCGCCTGACAAATGTGACCTTCCCTACTGGCCAGGTGAGCAQ 

TTTCCGAAGTGATACAGACAGTTCAGTqCATGTCCAGGTAGAGACCTCCA'GCAAGGATGATGTCACCATAACCACCAACCT' 

GTCTGCCTCAGGCGCCTTCTACACACTGCTGCAAGACCAAGTCCGGAACAGCTACTACATCGGGGCCGATGGCTCCTTGCG 

GCTGCTGCTGGCCAACGGCATGGAGGTGGCGCTGCAGACTGAGCCCCACTTGCTGGCTGGCACCGTCAACCCCACCGTGGG 

CAAGAGGAATGTCACGCTGCCCATCGACAACGGCCTCAACCTGGTGGAGTGGCGCCAGCGCAAAGAGCAGGCTCGGGGCCA 

GGTCACTGUCTTTGGGCGCCGGCTGCGGGTGCTCCAGGTTCACAACCGAAATCTCCTATCTCTGGACTTTGATCGCGTAAC 

ACGCACAGAGAAGATCTATGATGACCACCGCAAGTTCACCCTTCGGATTCTGTACGACCAGGCGGGGCGGCCCAGCCTCTG 

GTCACCCAGCAGCAGGCTGAATGGTGTCAACGTGACATACTCCCCTGGGGGTTACATTGCTGGCATCCAGAGGGGCATCAT 

GTCTGAAAGAATGGAATACGACCAGGCGGGCCGCATCACATCCAGGATCTTCGCTGATGGGAAGACATGGAGCTACACATA 

CTTAGAGAAGGCAGGTGTCCAGTCCATGGTGCTGCTACTACACAGCCAGAGGCAGTATATCTTTGAGTTCGACAAGAATGA 

CCGCCTCTCTTCTGTGACGATGCCCAACGTGGCGCGGCAGACACTAGAGACCATCCGCTCAGTGGGCTACTACAGAAACAT 

CTATCAGCCCCCTGAGGGCAATGCCTCAGTCATACAGGACTTCACTGAGGATGGGCACCTCCTTCACACCTTCTACCTGGG 

CACTGGCCGCAGGGTGATATACAAGTATGGCAAACTGTCAAAGCTGGCAGAGACGCTCTATGACACCACCAAGGTCAGTTT 

CACCTATGACGAGACGGCAGGCATGCTGAAGACCATCAACCTACAGAATGAGGGCTTCACCTGCACCATCCGCTACCGTCA 

GATTGGGCCCCTGATTGACCGACAGATCTTCCGCTTCACTGAGGAAGGCATGGTCAACGCCCGTTTTGACTACAACTATGA 

CAACAGCTTCCGGGTGACCAGCATGCAGGCTGTGATCAACGAGACCCCACTGCCCATTGATCTCTATCGCTATGATGATGT 

GTCAGGCAAGACAGAGCAGTTTGGGAAGTTTGGTGTCATTTACTATGACATTAACCAGATGATCACCACAGCTGTCATGAC 

CCACACCAAGCATTTTGATGCATATGGCAGGATGAAGGAAGTGCAGTATGAGATCTTCCGCTCGCTCATGTAdTGGATGAC 

CGTCCAGTATGATAACATGGGGCGAGTAGTGAAGAAGGAGCTGAAGGTAGGACCCTACGCCAATACCACTCGCTACTCCTA 

TGAGTATGATGCTGACGGCCAGCTGCAGACAGTCTCCATCAATGACAAGCCACTCTGGCGCTACAGCTACGACCTCAATGG 

GAACCTGCACTTACTGAGCCCTGGGAACAGTGCACGGCTCACACCACTACGGTATGACATCCGCGACCGCATCACTCGGCT 

GGGTGACGTGCAATACAAGATGGATGAGGATGGCTTCCTGAGGCAGCGGGGCGGTGATATCTTTGAGTACAACTCAGCTGG 

CCTGCTCATCAAGGCCTACAACCGGGCTGGCAGCTGGAGTGTCAGGTACCGCTACGATGGCCTGGGGCGGCGCGTGTCCAG 

CAAGAGCAGCCACAGCCACCACCTGCAGTTCTTCTATGCAGACCTGACCAACCCCACCAAGGTCACCCACCTGTACAACCA 

CTCCAGCTCTGAGATCACCTCCCTCTACTACGACTTGCAAGGACACCTCTTTGCCATGGAGCTGAGCAGTGGTGATGAGTT 

TTACATAGCTTGTGACAACATCGGGACCCCTCTTGCTGTCTTTAGTGGAACAGGTTTGATGATCAAGCAAATCCTGTACAC 

AGCCTATGGGGAGATCTACATGGATACCAACCCCAACTTTCAGATCATCATAGGCTACCATGGTGGCCTCTATGATCCACT 

CACCAAGCTTGTCCACATGGGCCGGCGAGATTATGATGTGCTGGCCGGACGCTGGACTAGCCCAGACCACGAGCTGTGGAA 

GCACCTTAGTAGCAGCAACGTCATGCCTTTTAATCTCTATATGTTCAAAAACAACAACCCCATCAGCAACTCCCAGGACAT 

CAAGTGCTTCATGACAGATGTTAACAGCTGGCTGCTCACCTTTGGATTCCAGCTACACAACGTGATCCCTGGTTATCCCAA 

ACCAGACATGGATGCCATGGAACCCTCCTACGAGCTCATCCACACACAGATGAAAACGCAGGAGTGGGACAACAGCAAGGT 

AATTCCTGCACAAGGCTGCCAGTCTATCCTCGGGGTACAGTGTGAAGTACAGAAGCAGCTCAAGGCCTTTGTCACCTTAGA 

ACGGTTTGACCAGCTCTATGGCTCCACAATCACCAGCTGCCAGCAGGCTCCAAAGACCAAGAAGTTTGCATCCAGCGGCTC 

AGTCTTTGGCAAGGGGGTCAAGTTTGCCTTGAAGGATGGCCGAGTGACCACAGACATCATCAGTGTGGCCAATGAGGATGG 

GCGAAGGGTTGCTGCCATCTTGAACCATGCCCACTACCTAGAGAACCTGCACTTCAGCATTGATGGGGTGGATACCCATTA 

CTTTGTGAAACCAGGACCTTCAGAAGGTGACCTGGCCATCCTGGGCCTCAGTGGGGGGCGGCGAACCCTGGAGAATGGGGT 

CAACGTCACTGTGTCCCAGATCAACACAGTACTTAATGGCAGGACTAGACGCTACACAGACATCCAGCTCCAGTACGGGGC 

ACTGTGCTTGAACACACGCTACGGGACAACGTTGGATGAGGAGAAGGCACGGGTCCTGGAGCTGGCCCGGCAGAGAGCCGT 

GCGCCAAGCGTGGGCCCGCGAGCAGCAGAGACTGCGGGAAGGGGAGGAAGGCCTGCGGGCCTGGACAGAGGGGGAGAAGCA 

GCAGGTGCTGAGCACAGGGCGGGTGCAAGGCTACGACGGCTTTTTCGTGATCTCTGTCGAGCAGTACCCAGAACTGTCAGA 

CAGCGCCAACAACATCCACTTCATGAGACAGAGCGAGATGGGCCGGAGGTGACAGAGAGGACCAAGGACTTCTTGCCAAAG 

ACAGCTACTCTTTT 
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The disclosed NOV1 nucleic acid sequence, localized to chromsome 11, and has 3182 
of 3865 bases (82%) identical to &Mus musculus Ten-m4 mRNA (gbrGENBANK- 
ID:AB025413|acc:AB025413.1) (E = 0.0). Similiarity information was assessed using public 
nucleotide databases including all GenBank databases and the GeneSeq patent database. 
Chromosome information was assigned using OMIM and the electronic northern tool from 
Curatools to derive the the chromosomal mapping of the SeqCalling assemblies, Genomic 
clones, and/or EST sequences that were included in the invention. 

In all BLAST alignments herein, the u E-value ? ' or "Expect" value is a numeric 
indication of the probability that the aligned sequences could have achieved their similarity to 
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the BLAST query sequence by chance alone, within the database that was searched. For 
• example, the probability that the subject ("Sbjct" ) retrieved from the NOV1 BLAST analysis,, 
e.g., Mus musculus Ten-m4 mRNA, matched the Query NOV1 sequence purely by chance is 
0.0, The Expect value (E) is a parameter that describes the number of hits one can "expect 1 ' to 
5 see just by chance when searching a database of a particular size. It decreases exponentially 
with the Score (S) that is assigned to a match between two sequences. Essentially, the E value 
describes the random background noise that exists for matches between sequences. 

The Expect value is used as a convenient way to create a significance threshold for 
reporting results. The default value used for blasting is typically set to 0.0001. In BLAST 2.0, 

10 'the Expect value is also used instead of the P value (probability) to report the significance of 
matches. For example, an E value of one assigned to a hit can be interpreted as meaning that 
. in a database of the current size one might expect to see one match with a similar score simply 
by chance. An E value of zero means that one would not expect to see any matches with a 
similar score simply by chance. See, e.g., 

1 5 http://www.ncbi.nlm.nih.gov/Education/BLASTinfo/. Occasionally, a string of X's or N's 
will result from a BLAST search. This is a result of automatic filtering of the query for low- 
complexity sequence that is performed to prevent artifactual hits. The filter substitutes any 
low-complexity sequence that it finds with the letter "N" in nucleotide sequence (e.g., 
"NNNNNNNN") or the letter "X" in protein sequences (e.g., "XXX"). Low-complexity 

20 regions can result in high scores that reflect compositional bias rather than significant position- 
by-position alignment. Wootton and Federhem Methods Enzymol 266:554-571, 1996 . 

A NOV1 polypeptide (SEQ ID NO:2) encoded by SEQ ID NO.T has 2794 amino acid 
residues and is presented using the one-letter code in Table IB. Signal P, Psort and/or 
Hydropathy results predict that NOV1 does not contain a signal peptide and is likely to be 

25 localized to the mitochondrial inner membrane with a certainty of 0.8363, to the plasma 
membrane with a certainty of 0.6500 and to the nucleus with a certainty of 0.6000. 

Table IB. Encoded NOV1 protein sequence (SEQ ID NO:2). 

MDVKERKPYRSLTRRRDAERRYTSSSADSEEGKAPQKSySSSETLKAYDQDARLAYGSRVKDIVPOEAEEFCRTGANFTLR 
ELGLEEVTPPHGTLYRTDIGLPHCGYSMGAGSDADMEADTVLSPEHPVRLWGRSTRSGRSSCLSSRANSNLTLTDTEHENT 
ETDHPGGLQNHARLRTPPPPLSHAHTPNQHHAASINSt.NRGNFTPRSNPSPAPTDHSLSGEPPAGGAQEPAHAQENWLLiNS 
NIPLETRNLGKQPFLGTLQDNLIEMDILGASRHDGAYSDGHPLFKPGGTSPIjFCTTSPGYPLTSSTVYSPPPRPLPRSTPA 
WPAFNIiKKPSKYCNWKCAALSAI VI SATLVI LLAYFVAMHLFGLNWHLQPMEGQMYEI TEDTASSWP VPTDVSLYPSGGTG 
LETPDRKGKGTTEGKPSSFFPEASFIDSGEIDVGRRASQKIPPGTFWRSQVFIDHPVHLKFNVSLGKAALVGIYGRKGLPP 
SHTQFDFVELLDGRRLLTQEARSLEGTPRQSRGTVPPSSHETGFIQYLDSGIWHIAFYNDGKESEVVSFLTTAIIiDSWALC 
LGDGECVSGTCHCFPGFLGPDCSRAACPVLCSGNGQYSKGRCLCFSGWKGTECDVPTTQCIDPQCGGRGICIMGSCACNSG 
YKGESCEEADCIDPGCSNHGVCIHGECHCSPGWGGSNCEILKTMCPDQCSGHGTYLQESGSCTCDPNWTGPDCSNEICSVD 
CGSHGVCMGGTCRCEEGWTGPACNQRACHPRCAEHGTCRDGKCECSPGWNGEHCTISLAHYLDRWKLSEGCPGLCNGNGR 
CTLDLNGV^mCVCQLGWRGAGCDTSMETACGDSl<I)NDGDGL\7DCMDPDCCLQPLCHINPLCbGSPNPLDIIQETQVPVSQQN 
LHSFYDRIKFLVGRDSTHIlPGENPFDGGHACVIRGQVMTSDGTPIiVGVNISFVNNPLFGYTISRQDGSFDLVTNGGISII 
LRFERAPFITOEHTLWLPWDRFFVMETIIMRHEENEJPSCDLSNFARPNPVVSPSPIiTSFASSCAEKGPIVPEIOALQEEl 
SISGCKI-1RLSYLSSRTPGYKSVLRISLTHPTIPFNLMKVHLMVAVEGRLFRKWFAAAPDLSYYFIWDKTDVYNQKVFGLSE 
A^S VGYEYSSCPDLILWEKRTTVLQGYEIDASKL G^GW SLDKHHAIiNIQSGGlLHKGNGENQFVSQQPPVIGSlMGWGRRR 

" 10 

BNSDOCID: <WO_02055704A2_I_ s 



WO 02/05570-4 



PCT/US02/00554 



10 



15 



SISCPSCNGbADGNKLLiAPVALTCGSDGSLYVGDFNYIRRIFPSGNVTNILEIjRVRNKDFRHSHSPAHKYyLA'rDPMSGAV 
FLSDSNSRRVFKIKSTVWKDLVKNSEWAGTGDOCLPFDDTRCGDGGKATEATLTNPRGPPGITVDKFGLIYFVDGTMIR 
R I DQNG 1 1 S TLLG SNDLTS ARPLS CDS VMD I SQVRQVHLEWPTDIiAINPMDNSLYVLDNNVVIjQI SENHQVRI VAGRPMHC 
QVPGlDHFLLSKVAIHATbESATAIAVSHNGVliYIAETDEKKINRIRQVTTSGEISLVAGAPSGCDCKNDANCDCFSGDDG 
YAKDAKWJTPS SLAVCADGELYVADLGNI RI RFIR|CNKPFLNTQNMYE^SPIDQELYLFDTTGKHLYTQSLPTGDYLYNF 
TYTGDGDITLITDNNGNMVNVRRDSTGMPIjWLW 

TTFYEYDSFGRLTNVTFPTGQVSSFRSDTDSSVHVQVETSSKDDVTITTNLSASGAFYTLLQDQVRNSYYIGADGSLRJjIiL 
ANGMEVALQTE PHLLAGTVNPTVGKRNVTLP I DNGLNLVEWRQRKEQARGQVTVFGRRLRVLQVHNRNLLSLDFDRVTRTE 
KIYDDHRKFTLRILYDQAGRPSLWSPSSRLNGVNVTYSPGGYIAGIQRGIMSERMEYDQAGRITSRIFADGKTWSYTYLEK 
AGVQSMVLLLHSQRQYIFEFDKNDRLSSVTMPNVARQTLETIRSVGYYRNIYQPPEGNASVIQDFTEDGHLLHTFYLGTGR 
RVIYKYGKLSKLAETLYDTTKVSFTYDETAGMLKTINLQNEGFTCTIRYRQIGPLIDRQIFRFTEEGMVNARFDYNYDNSF 
RVTSMQAVINETPLPIDLYRYDDVSGKTEQFGKFGVIYYDINQIITTAVMTHTKHFDAYGRMKEVQYEIFRSLrf^VTMTVQY 
DNMGRWKKELK^GPYANTTRYSYEYDADGQLQTVSINDKPLWRYSYDLNGNLHLLSPGNSARLTPLRYDIRDRITRLGDV 
QYKMDEDGFLRQRGGDI FEYNSAGbLI KAYNRAGSWSVRYRYDGLGRRVSSKSSHSHHLQFFYADLTNPTKVTHLYNHSSS 
EITSLYYDLQGHLFAMELSSGDEFYIACDNIGTPIoAVFSGTGLMIKQILYTAYGEIYMDTNPNFQIIIGYHGGLYDPIiTKL 
VHMGRRDYDVIAGRWTSPDHELWKHLSSSNVMPFNLYM^ 

DAMEPSYELIHTQMKTQEWDNSKVIPAQGCQSILGVQCEVQKQLKAFVTLERFDQLYGSTITSCQQAPKTKKFASSGSVFG 
KGVKFALKDGRVTTDI I SVANEDGRRVAAI LNHAHYL.ENLHFTIDGVDTHYFVKPGPSEGDLAI LGLSGGRRTLENGVNVT 
VSQINTVLNGRTRRYTDlQLQYGALCLNTRYGTTIjDEEKARVLEIARQRAVRQAWAREQQRliREGEEGLRAWTEGEKQQVL 
STGRVQGYDGFFVISVEQYPELSDSANNIHFMRQSEMGRR . 



The NOV] amino acid sequence has 2394 of 2542 amino acid residues (94%) identical 
to, and 2445 of 2542 amino acid residues (96%) similar to, a Mus musculus 2771 amino acid 
residue TEN-M4 protein (ptnr:SPTREMBL-ACC:Q9WTS7) (E - 0.0). 

The disclosed NOV1 is expressed in at least the following tissues: Brain, Brain Stem, 
Coronary Artery, Heart, Kidney, Liver, Lymph node, Mammary gland/Breast, Ovary, Parietal 
Lobe, Pituitary Gland, Prostate, Synovium/Synovial membrane, Testis, Uterus and Whole 
Organism. This information was derived by detennining the tissue sources of the sequences 
that were included in the invention including but not limited to SeqCalling sources, Public 
EST sources, Literature sources, and/or RACE sources. 

Possible small nucleotide polymorphisms (SNPs) found for NOV1 are listed in Table 
1C. For this and all following SNP tables, depth represents the number of clones covering the 
region of the SNP. The putative allele frequence (PAF) is the fraction of these clones 
containing the SNP. A dash, when shown, means that a base is not present. The sign ">" 
means "is changed to." Silent means that the indicated SNP does not cause a corresponding 
amino acid change. 



Table 1C: SNPs 


Variant 


Nucleotide 


Base 


Amino 


Base 




Position 


Change 


Acid 
Position 


Change 


13374261 


86 


A > G 


28 


Asp > Gly 


13374262 


194 


T>C 


64 


Val > Ala 


13374263 


229 


G > A 


76 


Ala > Thr 


13374264 


354 


G > A 


117 


Met > He 



NOVl has homology to the amino acid sequences shown in the BLASTP data listed in 
Table ID. 
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Table ID. BLAST results for NO VI 


Gene Index/ 
Identifier- 


Protein/ 
Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
-(%) 


Expect 


Rill6551957|dbi|BAB712 
06JJ(AKO56531) 


unnamed protein 
product [Homo 
sapiens] 


. , 730 


694/738 
(94%) 


695/738 
(94%) 


0.0 


Ri|7657417|rei1NP 03598 
7J1 (NM 01 1857) 


odd Oz/ten-m 
homolog 3 

(Drosophila); 

odd Oz/ten-m , 
homolog 1 

(Drosophila) 
[Mus musculus] 


2715 


1856/2809 
(66%) 


2204/2809 
(78%) 


0.0 


pi|1773773(9|reflNP 5242 
15Ji(NM_07949lV 


Tenasqin major; 
odd-Oz; odd Oz 

[Drosophila 
melanogaster] 


2515 


792/2469 
(32%) 


1279/2469 
(51%) 


0.0 


ei 18922444 IreflNP 06057 
4.11 (NM_ 018104) 


hypothetical 

protein 
FLJ 10474; 
hypothetical 
. protein 
FLJ 10886 
[Homo sapiens] 


1045 


707/1065 
(66%) , 

t 


863/1065 
(80%) 

i 


0.0 


eil7023205!dbi(BAA9187 
9JJ(AK001748) 


unnamed protein 
product [Homo 
sapiens] 


964 


660/985 
(67%) 


797/985 
(80%) 


0.0 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table IE. In the ClustalW alignment of the NOV1 protein, as 
well as all other ClustalW analyses herein, the black outlined amino acid residues indicate 
5 regions of conserved sequence {i.e., regions that may be required to preserve structural or 
functional properties), whereas non-highlighted amino acid residues are less conserved and 
can potentially be altered to a much broader extent without altering protein structure or 
function. 

Table IE. ClustalW Analysis of NOV1 

1) NOVl (SEQ ID NO:2) 

2) gi [l 655 1 957|dbi|B AB7 1 206. 1 1 (AK056531) unnamed protein product [Homo sapiens] (SEQ ID NO:51) 

2) eil76574 1 7|reflNP 035987.2( (NM_01 1857) odd Oz/ten-m homolog 3 (Drosophila); odd Oz/ten-m homolog 
1 (Drosophila) [Mus musculus] (SEQ ID NO:52) 

3) gi[17737739lreflNP 524215.31 (NM_079491) Tcnascin major; odd-Oz; odd Oz [Drosophila melanogaster] 
(SEQ ED NO:53) 

4) gil8922444fref]NP 060574. 1| (NM_0 1 8 1 04) hypothetical protein FLJ1 0474; hypothetical protein 
FLJ10886 [Homo sapiens] (SEQ ID NO:54) 

5) gil7023205ldbilB AA9 1 879. 1 1 (AJC001748) unnamed protein product [Homo sapiens] (SEQ ID NO:55) 

10 20 30 40 50 60 70 

I I I I I 1 I I I I i I i I 

NOV! MDVKERKPyRSLTRRRDAERRYTS--SSADSEEGKAPQKSySSSETLKAYDQDAK-LAYGSRVKDIVPQE 

gi |16551957| 

gi)76574l7] MDVKERRPYCSLTKSRREKERRYTNSSADNEECRVPTQKSYSSSETLKAFDHDYSRLIjYGNRVKDLVHRE 

gijl7737739l ■ 

gi|8922444| " 
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The presence of identifiable domains in NOVl, as well as all other NOVX proteins, 
was determined by searches using software algorithms such as PROSITE, DOMAIN, Blocks, 
Pfam, ProDomain, and Prints, and then determining the Interpro number by crossing the 
domain match (or numbers) using the Interpro website (http:www.ebi.ac.uk/ interpro). 
DOMAIN results for NOVl, as disclosed in Tables IF, were collected from the Conserved 
Domain Database (CDD) with Reverse Position Specific BLAST analyses. This BLAST 
analysis software samples domains found in the Smart and Pfam collections. For Tables IF 
and all successive DOMAIN sequence alignments, fully conserved single residues are 
indicated by black shading or by the sign (|) and "strong" semi-conserved residues are 
indicated by grey shading or by the sign (+). The "strong" group of conserved amino acid 
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residues may be any one of the following groups of amino acids: STA, NEQK, NHQK, 
NDEQ,QHRK,MILV,MIIJ;'HY,FYW. ' ' 1 

Table IF list the domain description from DOMAIN analysis results against NOV1 . 
This indicates" that the NOV1 sequence has properties similar to those of other proteins known 
to contain these domains. 



Table IF Domain Analysis of NOV1 

gnl | Pf am[ pf amOlSOO , K<eratin_B2, Keratin, high sulfur B2' protein. High' 
sulfur proteins are cysteine-rich proteins synthesized during the 
differentiation of hair matrix cells, and form hair fibers in 
association with hair keratin intermediate filaments. This family has 
been divided up into four regions, with the second region containing 8 
copies of a .short repeat. This family is also known as B2 or KAP1 . 
(SEQ ID NO :56) 

Length = 144 residues, 97.2% aligned 

Score = 41.6 bits (96), Expect - 6e-04 



NOV1 : 


624 


01500 : 
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NOV1 : 
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01500 : 


49 
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01500: 


99 



TTOCIDP0CGGRGICIMGSCACNSGYKGESCEEADCIDPGCSNHGVC1HGECHCSPGWGG 6 83 

1+ I M I I I + II + I I II II 
TSCCGFPTCSTLGTCGSSCC QPPSCCQPSCCQPVCSQTTC CRPTCFO 4 8 

SNCEI LKTMCPDQCSGHGTYLQESGSCTCDPNWTGPDCSNEI CSVDCGSHGVCMGGTCR- 74 2 

1+1*11 I + II. + IN 

SSC-CRPSCCOTSC CQPTCCQSSSCQTGCGIGSCRTRWCRPDCRVEGTCLP 98 

- - CEEGWTGPACNQRACHPRCAEHGTCRDGKCE CS PGWNGEHCT 784 

I III II 1+ III II II 

PCCWSCTPPTC CQPVSAQASCCRPSYCGQSCCRPACCCFPCT 141 

The Drosophila gene ten-m/odz is the Only pair rule gene identified to date which is not 
a transcription factor. In an attempt to analyze the structure and the function of ten-m/odz in 

10 mouse, four murine ten-m cDNAs which code for proteins of 2,700-2, 800 amino acids were 
isolated. All four proteins (Ten-ml-4) lack signal peptides at the NH2 terminus, but contain a 
short hydrophobic domain characteristic of transmembrane proteins, 300-400 amino acids 
after the NH2 terminus. About 200 amino acids COOH-terminal to this hydrophobic region 
are eight consecutive EGF-like domains. Cell transfection, biochemical, and 

15 electronmicroscopic studies suggest that Ten-ml is a dim eric type II transmembrane protein. 
Expression of fusion proteins composed of the NH2-terminal and hydrophobic domain of ten- 
ml attached to the alkaline phosphatase reporter gene resulted in membrane-associated 
staining of the alkaline phosphatase. Electronmicroscopic and electrophoretic analysis of a 
secreted form of the extracellular domain of Ten-ml showed that Ten-nil is a disulfide-linked 

20 dimer and that the dimerization is mediated by EGF-like modules 2 and 5 which contain an 

odd number of cysteines. Northern blot and immunohistochemical analyses revealed 

widespread expression of mouse ten-m genes, with most prominent expression in brain. All 

four ten-m genes can be expressed in variously spliced mRNA isoforms. The extracellular 

domain of Ten-ml fused to an alkaline phosphatase reporter bound to specific regions in many 
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tissues which were partially overlapping with the Ten-ml irmnunostairiing. Far Western 
assays and electronmicrAscopy demonstrated that Ten-ml can bind to itself. 

Zebrafish ten-'m3 and teri~m4 encode proteins highly similar to theproduct of 
Drosophila pair-rule gene ten(m)/odd Oz (odz). Their products contain eight epidermal growth 
5 factor (EGF)-like repeats that resemble mostly those of the extracellular matrix molecule 
tenascin. During segmentation period, ten-m3 is expressed in the somites, notochord, 
pharyngeal arches, and the brain, while expression of ten-m4 is mainly restricted to the 1 brain. 
In the developing brain, ten-m3 and ten-m4 expression delineates several compartments. 
Interestingly, ten-m3 and ten-m4 s^ow expression patterns complementary to each other in the 

10 developing forebrain and midbrain along both rostrocaudal and dorsoventral axes, depending 
on developmental stages and locations. 

The above defined information forNOVl suggests that theNOVT protein may 
function as a member of a family of novel TEN-M4-like proteins. Therefore, the NOV1 
nucleic acids and proteins of the invention are useful in potential' therapeutic applications 

15 implicated in various diseases and disorders described below and/or other pathologies. For 
example, the NOV1 compositions of the present invention will have efficacy for treatment of 
patients suffering from cancer, trauma, regeneration (in vitro and in vivo), 
viral/bacterial/parasitic infections, Endocrine dysfunctions, .Diabetes, obesity, Growth and 
reproductive disorders, Von Hippel-Lindau (VHL) syndrome, Alzheimer's disease, Stroke, 

20 Tuberous sclerosis, hypercalceimia, Parkinson's disease, Huntington's disease, Cerebral palsy, 
Epilepsy, Lesch-Nyhan syndrome, Multiple sclerosis, Ataxia-telangiectasia, 
Leukodystrophies, Behavioral disorders, Addiction, Anxiety, Pain and/or Neuroprotection, 
The NOV1 nucleic acid encoding TEN-M4-like proteins, and the TEN-M4-like proteins of the 
invention, or fragments thereof, may further be useful in diagnostic applications, wherein the 

25 presence or amount of the nucleic acid or the protein are to be assessed. 

NOV2 

A disclosed NOV2 nucleic acid of 1024 nucleotides (also referred to as 
20422974J3_132_dal) encoding a novel semphorin-like protein is shown in Table 2A. An 
open reading frame was identified beginning with an ATG initiation codon at nucleotides 33- 
30 35 and ending at nucleotides 1022-1024. Putative untranslated regions upstream from the 

initiation codon and downstream from the termination codon are underlined in Table 2A, and 
the start and stop codons are in bold letters. 
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Table 2A. NOV2 Nucleotide Sequence (SEQ ID NO:3). 

CTGGCCTGAAGCTCAGAGCCGGGGCGTGCGCCA TGGCCCCACACTGGGCTGTCTGGCTGCTGGCAGC7VRGGCTGTGGGGCC 
TGGGCATTGGGGCTGAGGTGTGGTGGAACCTTGTGCCGCGTAAGACAGTGTCTTCTGGGGAGCTGGCCACGGTAGTACGGC 
GGTTCTCCCAGACCGGCATCCAGGACTTCCTGACACTGACGCTGACGGAGCCCACTGGGCTTCTGTACGTGGGCGCCCGAG 
AGGCCCTGTTTGCCTTCAGCATGGAGGCCCTGGAGCTGCAAGGAGCGATCTCCTGGGAGGCCCCCGTGGAGAAGAAGACTG 
AGTGTATCCAGAAAGGGAAGAACAACCAGACCGAGTGCTTCAACTTCATCCGCTTCCTGCAGCCCTACAATGCCTCCCACC 
TGTACGTCTGTGGCACCTACGCCTTCCAGCCCAAGTGCACCTACGTCAACATGCTCACCTTCACTTTGGAGCATGGAGAGT 
TTGAAGATGGGAAGGGCAAGTGTCCCTATGACCCAGCTAAGGACCATGCTGGCCTTCTTGTGGATGGTGAGCTGTACTCGG 
CCACACTCAACAACTTCCTGGGCACGGAACCCATTATCCTGCGTAACATGGGGCCCCACCACTCCATGAAGACAGAGTACC 
TGGCCTTTTGGCTCAACGGGGAGCGGGCAGTGGAGTCCGACTGCTATGCCGAGCAGGTGGTGGCTCGTGTGGCCCGTGTCT 
GCAAGGGCGATATGGGGGGCGCACGGACCCTGCAGAGCAAGTGGACCACGTTCCTGAAGGCGCGGCTGGCATGCTCTGCCC 
CGAACTGGCAGCTCTACTTCAACCAGCTGCAGGCGATGCACACCCTGCAGGACACCTCCTGGCACAACACCACCTTCTTTG 
GGGTTTTTCAAGCACAGTGGGGTGACATGTACCTGTCGGCCATCTGTGAGTACCAGT^GGAAGAGATCCAGCGGGTGTTTG 
AGGGCCCCTATAAGGAGTACCATGAGGAAGCCCAGAAGTGGGACCGCrACAC 



The disclosed NOV2 polypeptide (SEQ ID NO:4) encoded by SEQ ID NO:3 has 805 
amino acid residues and is presented in Table 2B using the one-letter amino acid code. Signal 
P, Psort and/or Hydropathy results predict that NOV2 does not contain a signal peptide and is 
5 1 likely to be localized to the mitochondrial inner membrane with a certainty of 0.8000 and to 
the plasma membrane with a certainty of 0.7000. The NOV2 polypeptide has an estimated 
molecular weight of 89454.5. 



Table 2B. Encoded NOV2 protein sequence (SEQ ID NO:4). 



MAPHWAVWLLAARIjWGLGIGAEVWWNIjVPRKTVSSGE 

EL.QGAISWEAPVEKKTECIQKGKNNQTECFNFIRFLQPYNASHLYVCGTyAFQPKCTYVNMLTFTLEHGEPEDGKGKCPYD 

PAKDHAG LLVDGEliY S ATLNNFLGT E P 1 1 LRNMG PHH S MKT E YIAFW LNGERAVE SDC Y AEQWARVAR VCKGDMGG ARTL 

QRKWTTFbKARLACSAPNWQLYFNQLQAMHTLQDTSWHNTTFFGVFQAQWGDMYLSAICEYQIjEEIQRVFEGPYKEYHEEA 

QKWDRYTDPVPSPRPGSCINNWHRRHGYTSSLELPDNILNFVKKHPLMEEQVGPRWSRPLLVKKGTNFTHLVADRVTGIjDG 

ATYTVLFIGTGDGWLLKAVSLGPWWLIEELQLFDQEPMRSL.VLSQSKKLLFAGSRSQLVQLPVADCMKYRSCADCVLARD 

PYCAWSVNTSRCVAVGGHSGSLLIQHVMTSDTSGIC^LRGSKKVR^ 

PAEQPGSFLYDARLQALVVMAAQPPJiAGAYHCFSEEQG 

LVLIiLLVLSLRRRLREELEKGAKATERTLVYPLELPKEPTSPPFRPCPEPDEKLWDPVGYYYSDGSIiKIVPGHARCQPGGG 
PPSPPPGIPGQPLPSPTRLHLGGGRNSNANGYVRLQLGGEDRGGLGHPIiPEIiADELRRKLQQRQPLPDSNPEESSV 



NOV2 has homology to the amino acid sequences shown in the BLASTP data listed in 
10 Table 2C. 



Table 2C. BLAST results for NOV2 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
{%) 


Positives 
(.%) 


Expect 


gi|l2698023 | db j j BAB 


KIAA1739 protein 
iHomo sapiens] 


963 


697/801 
(87%) 


699/801 
(87%) 


0.0 


21830. lj (AB051526) 


gi|8134699| sp|Q6415 


Semaphorin 4C 

precursor 
(Semaphorin I) 
(Sema I) 
(Semaphorin C-like 
1) [Mus musculus] 


834 


637/834 
(76%) 


677/834 
(80%) 


0 . 0 


1 |SM4C MOUSE 
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gi J 1702834 6 jqblAAHO. 


sema domain, 
immunoglobulin 
' ' domain (Ig) , 
transmembrane 
' domain TM) and 
short cytoplasmic. . 
domain, 
(semaphorin) 4C 
[Homo sapiens] 


510 


452/510 
(88%) , 


452/510 
(88%). 


0.0 


7476.1|AAH17476 
(BC017476) 


gi|8923346|ref |NP 0 


sema domain, 
immunoglobulin 
domain (Ig) , 
transmembrane 
domain TM; sema 

domain , 
immunoglobulin 
domain (Ig) , 
• transmembrane 
domain (TM) and 
short cytoplasmic 
domain, 
(semaphorin) 4C 
[Homo sapiens] 


510 


451/510 
, (88%) 

I 


451/510 
(88%) 


0.0 


60259. ll 
(NM_017789) 


gi | 7305471 jref |NP 0 


sema domain, 
immunoglobulin 
domaiin' (Ig) , 
transmembrane 
domain (TM) and 
' short cytoplasmic 
domain, 
(semaphorin) 
4D; M-sema G; 
semaphorin H [Mus 
musculus) 


. 861 


263/615 
(42%) 


346/615 
(55%) 

i 


e-124 


38688 . 1 | 
(NM 013660) 



The homology between these and other sequences is shown graphically in the 
ClustalW analysis shown in Table 2D. 



Table 2D. ClustalW Analysis of NOV2 

1) Novel NOV2 (SEQIDNO:4) 

2) gi|12698Q23|d bilB AB21830.il (AB051526)'KJAA1739 protein [Homo sapiens] (SEQ ID NO:57) 

3) gj|8 1 34 699lsp|Q64 1 5 1 )SM4C_MOUSE Semaphorin 4C precursor (Semaphorin I) (Sema I) (Semaphorin C- 
like 1) [Mus musculus] (SEQ ID NO:58) 

4) gill 7Q28346|gblAAH \ 1416, 1 1AAH1 7476 (BC017476) sema domain, immunoglobulin domain (Ig), 
transmembrane domain TM) and short cytoplasmic domain, (semaphorin) 4C [Homo sapiens] (SEQ ID NO:59) 

5) gi[8923346|ref|NP_060259.11 (NM_017789) sema domain, immunoglobulin domain (Ig), transmembrane 
domain TM; sema domain, immunoglobulin domain (Ig), transmembrane domain (TM) and short cytoplasmic 
domain, (semaphorin) AC [Homo sapiens] (SEQ ED NO:60) 

6) gi[730547 1 IreflNP 038688.11 (NMJH3660) sema domain, immunoglobulin domain (Ig), transmembrane 
domain (TM) and short cytoplasmic domain, (semaphorin) 4D; M-sema G; semaphorin H [Mus musculus] (SEQ 
ID NO:61) 



10 20 30 40 50 60 70 

— I — I — ! — I — !••••! — I ! — I — I — I — I — I — I 

NOV2 

gi 1 12 698023 j LCSHLWQPGLGSCWSEGFPEAGSTHSRLCLLLCWTLIEAVGSRAKKEAAAEEAKVGWGCPALRPEVPLTL 

gi|B134699| — , 

gi|l7028346| 

gi|8923346[ 

gi|730547l| 

80 90 100 110 120 130 140 

....|....|....|....|....|....|....|....|....|....|....| |....|....| 

NOV2 MAgHWAVWLLAARLWG 

gi 1 12698 023 | RARA I S LMASSGRKLWLR YPS FLPAA WI CLLPGWERIjGR PRWGCQGQRLFQKCPLLgl RGFGWHIjIj VAWG 
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Si | 8134699 | 
gi 1 17028346 I 
gi |8923346 J 
gi|7305471 j 



NOV2 

gi|l2698023 
gi|8134699| 
gi(l7028346 
gi j 8923346 | 
gi{730547l| 



NOV2 

gi|l2698023 
gi|8134699| 
gi j 17028346 
igi(8923246 j 
gi|7305471 | 



NOV2 

gijl2698023 
gi|8!34699| 
gi|l7028346 
gi|8923346| 
gi]7305471 | 



NOV2 

gi|l2698023 
gi |8134699| 
gi|l7028346 
gi j 8923346 | 
gij7305471 j 



NOV2 

gi (12698023 
gi | 8134699 | 
gi J17028346 
gi j 8923346 | 
gi j 7305471 j 



NOV 2 

gi (12698023 
gi|8134699 j 
gi|l7028346 
gi j 8923346 | 
gi|7305471 \ 



NOV2 

gi {12698023 
gi|8134699l 
gi [17028346 
gi|8923346| 
gi | 7305471 | 



NOV2 

gi (12696023 
gi | 8134699 | 
gi) 17028346 
gi | 8923346 | 
gi (7305471 ( 



- MAgHWAVWLLAAGLWG 



160 



170 



150 

I I. I I I 

LGIGAEVWWNXiVPRKTVSS GELATV,; 

AGSRGARLRAV,EPOGSCPSAAMLTPAELA' 
LGIGAEMWWNLVPRKTVSS GELVT 1 



MRMCAgVRGLFLALWVLR 

190 '. 200 210 




TAVAFAPVPRLTWEHGEVG SvcjH;KP^Fj^;SABLMSgDKDT[j 

220 230 240 , 250 260 




gGATSffiEAP VEggTEgl Q 
AXsReA PVEygTEg'IQf 

;A3:s5Seapae!Si 




270 280 



jTyVNMLTgTL.EH 
II/tEtLEH 
iT.YINMLTgTLDR 



is|k®he'v^iwsep[2Ss; 

290 300 



LYVCGTjfflAFQP 



r^HgNJjT^KFLG 
340 350 





s-sgsp; 

400 



IPgBNEPSFVFAD 
410 420 



yv pe s vg s ftgddd kvy f f fr^ra^sdc y aeqywi 
fvpesvgsftgdddkiyfffs@r^SSyp,cys!eqvV^ 



RVARVCKGD KsgHrTLQ 
RVARVCKGD EggBrTIiQ 
RVARVCKGD SGGaRTLQ 


gKWT^FLKARllJcsMp 
gKWTHFIiKARlScSgP 

SkwtHflkarlScsSp 





VI QKS PDGPEGEDDKVYFFFTgvfeE^ FVF ^ L B p EEES3SS0SEj 



440 



450 



430 

W0LYj^0ff0AMHTflODTSWHNTTf|FGggQAQl 
WQLYHNQg.QAl^TgQDTSWHNTTgFGB5QAQW| 
WKVYfjMo"KAVHT",RGASWHNTT{jFG^OARl 



SGgv§NljjQDVFVjj^ 




SRGKYMQSATVEQSHT[22i 
550 



560 
•I 





670 
| - • 



690 



700 



ghsgslliqhvmtsdtsgicjglrgskkvrp, 
ghsgslliqhvmtsdtsgicSlrgskkvrpi 



GHSGSLLIQHVMTSDTSGiqgLRGSKKVRP 1 
gHSG SLLIQHVM TSDTSGIcSliRGSKKVRP^ 



'IjHOEEASgRGWIQqaSGlttS SCLDgSRES 
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770 



NOV2 

gi |l2698023 | 
gi I 8134699 j 
gi|l7028346 | 
gi [8923346 f 
gi [ 7305471 | 



NOV2 

gi jl2698023 j 
gi j 8134699 j 
gi|l7028346| 
gi j 8923346 | 
gi | 7305471 [ 



HOV2 

Qi (12698023 J 
gi | 8134699 | 
gi | 17028346 j 
gi | 8923346 | ' 
gi.j 7305471 | 



N0V2 

gi | 12698023 | 
gi | 8134699 | 
gi | 17028346 j 
gi j 8923346 | 
gi j 7305471 j 




1000 



N0V2 

gi|l2698023 [ 
gi j 8134699| 
gi|l7028346| 
gi j 8923346 | 
gij 7305471| 



g2\RDKPFDVKCELrKFADSDADGD 



Tables 2E and 2F list the domain description from DOMAIN analysis results against 
NOV2. This indicates that the N0V2 sequence has properties similar to those of other 

5 proteins known to contain these domains. 

I ~ i 

Table 2E. Domain Analysis of NOV2 

gnl ] Pf am [ pf amQ1403 , Sema, Sema domain. The Sema domain occurs in 
semaphorins, which are a large family of secreted and transmembrane 
proteins, some of which function as repellent signals during axon 
guidance. Sema domains also occur in the hepatocyte growth factor 
receptor. (SEQ ID NO: 62) 
Length ~ 433 residues, 97.0% aligned 

Score = 405 bits (1040), Expect = 6e-ll4 

NOV2: 6 6 
01403; 14 
NOV2 : 124 
01403: 74 
N0V2: 182 
01403: 134 



YVGARE7ALFAFSMEALE- -LQGAISWEAPVEKKTECIOKGKNNQTECFNFIRFLQPYNAS 12 3 

Mill ++I I I I I II III* III Mil II II + 

yVGARNRVYVLNLEDLSEVLNLKTGWPGS CETCEECNMKGKSPLTECTNFI RVLQAYWDT 73 
HLYVCGTYAFQPKCTYVNMLT- FTLEHGEFEDGKGKCPYDPAKDHAGLLV-DGELYSATL 181 

IMIIII mi II +1+ 1 + 1+ III I lllll + +11 Hill 1 + 

HLYVCGTNAFQPVCTLINLGDLFSLDVDNEEDGCGDCPYDPLGNTTSVLVQGGELYSGTV 13 3 

NNFLGTEPI I LRNMGPHHSMKTE - YLAFWLNG ERA 215 

+ 1 I +1 I I +1 I ++M + + III M 
IDFSGRDPSIRRLLGSHDGLRTEFHDSKWLNLPNFVDSYPIHYVHSFSDDKVYFFFRETA 193 
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N0V2: 


216 


014 03 : 


194 


NOV2: 


273 


01403 : 


253 


NOV2 : 


333 


01403 : 


312 


NOV2 : 


3 92 


01403 : 


364 


NOV2 : 


444 


01403 : 


424 



+ ■+• +u run i n 



n 



HTLQDTSWHNTTFFGVFOAQWGDMYLSAICEyQL.EEIQRVFEGPyKEYHEEAOKWDRyTD 

'i + + +ni n+i + + +i + ijin + i II i 

FVLPTGADTDPVL.YGVFTTSSNSSAGSAVCAFSMSDINOVFEGPFKHQSPN-SKWLPYRG 

PVPSPRPGSCINNWHRRHGYTSSLELPDNILNFVKKHPLMEEQVGPRWSRPLLVKKGTN- 

I I Mil I I I I 111+ I I I++ I I I l+l I I +111+ I 

KVPQPRPGQCPNA SGLNLPDDTLNFI RCHPLMDEWP PLHNVPLFVGQSGNY 

- FTHIiVADRVTGLDGA'irYTVLFIGTGDGWLLKAVSLG PWVHL I EELQLF - DQE P 

I + III II "11111 + 11 II +11 I I I ++I I +1 I I I 

RLTSIAVDRVRAGDGQIYTVLFLGTDDGRVLKQWLSRSSSASYLVWLEESLVFPDGEP 



MRSLVLSQSK 
+ + +| + |' 
VQRMVISSKN 



453 
433 



252 
332 
311 
391 
363 
443 
423 



Table 2F. Domain Analysis of NOV2 

gnl I Smart J smart00630 , Sen^a, semaphorin domain (SEQ ID NO: 63) 
Length - 430 residues,' 97.0% aligned 
Score b 397 bits (1021) , Expect = le-'lll 



NOV2: 


66 


00630: 


14 


NOV2: 


125 


00630: 


74 


NOV2: 


185 


00630: 


134 


NOV2 : 


219 


00630: 


194 


N0V2: 


278 


00630: 


254 


NOV2 : 


337 


00630: 


314 


N0V2 : 


395 


00630: 


368 


NOV2: 


450 


00630: 


427 



YVGAREALFAFSMEALELQG-AI SWEAPVEKKTECIQKGKNNQTECFNFIRFLQPYNASH 124 

iiiii i* i+ + ii* 1+1 mi i in i 

WGARNRLYVLSLNLISEAEVKTGWLSSPDCEECVSKGKDPPTDCVNFIRLLLDYNADH 73 



i mi mi i + n i i+i+n+n + +1111111 n + i 

LLVCGTNAFQPVCRLINLGNLDRLEVGRESGRGRCPFDPQHNSTAVLVDGELYVGTVADF 13 3 



-SMKTE-YLAFWLNG-- ERAVES 218 

n+i i + iii i ill 



+ n+iniiii nn i+i +in+inin in ni+i 



+ ll+l + I +! II l+ll +1 I 



II 



l + l 



ii +111++111+1 nn+i i i in n +i i 

- SSKDLPDDVLNFI KTHPLMDEWQPLTGRPLFVKTDSNYLLTS 3 S 7 



ii ii inii+n II +11 I 



-WVHLIEELQLFDQ-EPMRSLVL 44 9 
+ +H++M |+ Ml 



450 SQSK 453 
I I 

427 SPKK 430 

Members of the collapsin/semaphorin gene family have been proposed to act as growth 
cone guidance signals in vertebrates and invertebrates. To identify candidate molecules 
involved in axonal pathfinding during mouse embryogenesis, cDNAs from five new members 
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of the semaphorin family (Sem A-Sem E) were isolated. The murine semaphorin genes are 
differentially expressed in mesodenn and neuroectoderm before and during the time when 
axons select their pathways in the embryo. In .explant cultures, recombinant Sem D/collapsin 
converts a matrix permissive for axonal growth into one that is inhibitory for neurites of 
5 peripheral ganglia. Studies demonstrate that semaphorins are a diverse family of molecules 
that may provide local signals to specify territories nonaccessible for growing axons. (Puschel 
et al., Murine semaphorin D/coll!apsin is a member of a diverse gene family and creates : 
domains inhibitory for axonal extension. Neuron 14(5):941-8, 1995). 

The above defined information for NOV2 suggests that this NOV2 protein may 

1 0 function as ,a member of the semaphorin protein family. Therefore, the NOV2 nucleic acids 
and proteins of the invention are useful in potential therapeutic applications implicated in 
various diseases and disorders and/or other pathologies. For example, the NOV2 
compositions of the present invention will have efficacy for treatment of patients suffering 
from various disQases and disorders. The NOV2 nucleic acid encoding semaphorin-like 

3 5 protein, and the semaphorin-like protein of the invention, or fragments thereof, may farther be 
useful in diagnostic applications, wherein the presence or amount of the nucleic acid or the 
protein are to be assessed. 

NOV3 

A disclosed NOV3 nucleic acid of 1933 nucleotides (also referred to as CG50351-01) 
20 encoding a novel erythroid membrane-associated protein (ERMAP)-like protein is shown in 
Table 3 A. An open reading frame was identified beginning with a ATG initiation codon at 
nucleotides 100-102 and ending with a TAG codon at nucleotides 1837-1839. Putative 
untranslated regions upstream from the initiation codon and downstream from the termination 
codon are underlined in Table 3A, and the start and stop codons are in bold letters. 



Table 3A. NOV3 Nucleotide Sequence (SEQ ID NO:5) 



aacttcaccccagccttgcaaagtacagtcacctagttggtgttgtaattgtgacttcaaaaggccattccaccattctg 
tcaatccagctgattcagt atgatggagaacatggtaaaaccagtgaattccatgagcatgagcccactgccgcacgtct 
ttagccgtaaagtgagtgctttggtcagaggcaatcctccatctccaggctctgctgccacttcaggcttttccttgtgg 
gtatctcagcctcctagccagccatggggggatgccggcaagttccacgtggccctactagggggcacagccgagctgct 
ctgccctctctccctctggcccgggacggtacccaaggaggtgaggtggctgcggtccccattcccgcagcgctcccagg 
ctgttcacatattccgggatgggaaggaccaggatgaagatctgatgccggaatataaggggaggacggtgctagtgaga 
gatgcccaagagggaagtgtcactctgcagatccttgacgtgcgccttgaggaccaagggtcttaccgatgtctgatcca 
agttggaaatctgagtaaagaggacaccgtgatcctgcaggttgcagcatttctgataaggaaggaactcctggtgtgca 
gatcggtgggatggttcccagagccctgggccaagtggagagaacctcaaggcagggtacttccatccctgtcagaggcc 
cactctctggaaaaagctgggctcttccaaatagcagtgtctagcagagtcagggacagcacactggggaatgtgtcctg 
caccatccacaacatggcccttggccaagagaagaccacagctgtggtcatatcagccccatctgtggggagtctctccc 
cctcagcagtggctctggctgtgatcctgcctgtcctggtacttctcatcatggtgtgcctttgccttatcaaggcagtg 
gcaaaaaatctcattatcttttcctttcttcctctagacaatcttctttcagaccatgctaaagaaaaaggtaatgatat 
aaaagtagggaacgtttcttctttcatagagttgaaaagagctgcagcaaactcaggctggagaagagcccggttgcatt 
ttg tgg cagtg a ccctgg acccagac acagcac at ccc aaactc atcc tttc tgaggaccaaag atgtgtaagg cttg ga 
gacagacggcagcctgtacctgacaacccccagagatttgatttcgttgtcagcatcctaggctctgagtacttcacgac 
tggctgccactactgggaggtgtatgtgggagacaagaccaaatggattcttggagtatgtagtgagtcagtgagcagga 
aggggaaggttactgcctcacctgccaatggacactggcttctgcgacagagtcgtgggaatgagtatgaagctctcaca 
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TCCCCGCAGACCTCCTTCCGCCTTAAAGAGCCTCCACGGTGTGTGGGGATTTTCCTGGACTATGAAGCAGGAGTCATCTC 
TTTCTACAATGTGACCAACAAGTCCCACATCTTTACTTTCACCra 

CTTGCCTTCATGATGGAGGAAAAAACACAGCACCTCTAGTCATTTGTT(^GAACTACACAAATCAGAGGAATCAATTGTC 
CCCAGGCCAGAAGGGAAAGGCCATGCTAATGGAGATGTGTCCCTCAAGGTGAACTCTTCTTTACTACCCCCGAAGGCCCC 
AGAGCTGAAGGATATAATCCTGTCCTTGCCCCCTGACCTTGGCCCAG^^ 
GATATGCCACATTACCTGCTCCCATCACCATCCAGCCra 

GGAACGTCTCTTC ' 

The disclosed NOV3 nucleic acid sequence maps to chromosome 13 and has 727 of 
892 bases (81%) identical to a Mus musculus G. domesticus erythroid membrane- associated 
protein ERMAP mRNA (gb:GENBANK-ID:AF153906|acc:Af 1 53906.1) (E * 3.7e 199 ). 
5 A disclosed NOV3 protein (SEQ ED NO:6) encoded by SEQ ID NO:5 has 579 amino 

acid residues, and is presented using the one-letter code in Table 3B. Signal P, Psort and/or 
'Hydropathy results predict that NQV3 does not contain a signal peptide, and is likely to be 1 
localized to the plasma membrane with a certainty of 0.7000. 

Table 3B, Encoded NOV3 protein sequence (SEQ ID NO:6). 

MMENMVKPVNSMSMSPLPHVFSRKVSALVRGNPPSPGSAATSGFSDWVSQPPSQPWGDAGKFHVALLGGTAELLCPLSLW 
PGTVPKEVRWLRSPFPQRSQAVHIFRDGKDQDEDLMPEYKGRTVljVRDAQEGSVTLQII.DVRLEDQGSYRCIiIQVGNLSK 
EDTVILQVAAFLIRKELLVCRSVGWFPEPWAKWREPQGRVLPSLSEAHSLEKAGLFQIAVSSRVRDSTLGI^VSCTIHNMA 
LGQEKTTAWISAPSVGSLSPSAVALAVILPVLVLLlMVCLCIil KAVAKNM I FSFL>PLDNLI>SDHAKEKGNDIKVGNVS 
SFIELKRAAANSGWRRARLHFVAVTLDPDTAHPKLILSEDQRCVRLGDRRQPVPDNPQRFDFWSILGSEYFTTGCHYWE 
VYVGDKTKWILGVCSESVSRKGKVTASPANGHWLLRQSRGNEYEALTSPQTSFRLKEPPRCVGIFLDYEAGVISFYNVTN 
KSHIFTFTHNFSGPLRPFFEPCLHDGGKNTAPLVICSELHKSEESIVPRPEGKGHANGDVSDKVNSSIiLPPKAPELKDII 
LSLPPDLGPALQELKAPSF 

10 The NOV3 amino acid sequence has 384 of 591 amino acid residues (64%) identical 

to, and 456 of 591 amino acid residues (77%) similar to, a Mus musculus 592 amino acid 
residue erythroid membrane-associated protein (ERMAP) (ptnr:SPTREMBL-ACC:Q9JLN5) 
(E = 2.2e 197 ). 

NOV3 is expressed in at least the following tissues: Adrenal Gland/Suprarenal gland, 
15 Bone, Bone Marrow, Brain, Colon, Coronary Artery, Hair Follicles, Heart, Hippocampus, 
Kidney, Liver, Lung, Parathyroid Gland, Pituitary Gland, Prostate, Retina, Spinal Chord, 
Testis, Uterus and Whole Organism. This information was derived by determining the tissue 
sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, Public EST sources, Genomic Clone sources, Literature sources, and/or 
20 RACE sources. In addition, NOV3 is predicted to be expressed in the following tissues 

because of the expression pattern of a closely related Mus musculus erythroid membrane- 
associated protein ERMAP mRNA homolog (GENB ANK-ID : gb : GENB ANK-ID : AF 1 53 906 1 
accrAFl 53906.1): Kidney, Liver and Lung. 

NOV3 has homology to the amino acid sequences shown in the BLASTP data listed in 
25 Table 3C. 

26 

BNSDOCID: <WO„..02055704A2J_> 



WO 02/055704 



PCT/US02/00554 



Table 3C. BLAST results for NOV3 


Gene Index/ 
Identifier 


Protein/ 
Organism 


Length 
(aa) ' 


Identity 
' ' <%> ' 


Positives 
(%) 


Expect . 


gi| 7305041 |ref|NP 0 


erythroblast 
membrane- 
associated 

protein [Mus 
musculus] 


592 


373/595 
(62%) 


440/595 
(73%) 


0.0 


38876 .1 1 
<NM_013848) 


gi | 17489129 j ref |XP 


erythroblast 
membrane- 
associated 
protein [Homo 
, sapiens] 


475 


289/360 
(80%) 

i 


301/360 
(83%) 


e-162 


002035.3 | 
(XM_002035) 


gi[l5808373 |qb|AAL0 


erythroid 
membrane- 
associated 
protein . [Homo 
sapiens] 


188 


188/188 
(100%) 


188/188 
(100%) 


e-107 


8411. 1| (AF311284) 


gi [15079315 |qb|AAHl 


Similar to 
butyrophilin, 
subfamily 1, 

member Al 
[Mus 

musculus] 


524 


209/511 
(40%) 


278/511 
(53%) 


le-85 


1497.1|AAH11497 
(BC011497) 


gi|7304935|ref iNP 0 


butyrophilin 
[Mus 
musculus] 


524 


209/511 
(40%) 


277/511 
(53%) 


2e-85 


38511 .1 j 
(NM_013483) 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 3D. 

Table 3D. ClustalW Analysis of NOV3 

1) NOV3 (SEQ ID NO:6) 

2) gil7305041)reflNP 038876.1) (NM_013848) erythroblast membrane-associated protein [Mus musculus] 
(SEQ ID NO:64) 

3) gill7489129lref)XP Q02035.3[ (XM_002035) erythroblast membrane-associated protein [Homo sapiens] 
(SEQ ID NO:65) 

4) gill 5 808373 |gb| AAL084 11.11 (AF31 1284) erythroid membrane-associated protein [Homo sapiens] (SEQ ID 
NO:66) 

5) gill 50793 1 5|g blAAH 1 1 497. 1 IAAH 1 1 497 (BC01 1497) Similar to butyrophilin, subfamily 1, member Al 
[Mus musculus] (SEQ ID NO:67) 

6) gi|73049351reflNP 03851 1.1 1 (NMJ)13483) butyrophilin [Mus musculus] (SEQ ID NO:68) 



NOV3 



gi 
gi 
gi 
gi 



7305041 I 
174B9129) 
15808373 j 
15079315) 
7304935 j 



10 20 30 40 50 €0 70 

- I — .L-,- 1 - 1 - l.= - - - 1 — ! — I — I — I — I — I.-,- -..-Xi- - - 1 

MMEIWKPVNSMSMSPkPHVFSRKV^ALVR^^ 

MLKRLKKHWAWKMgVMPH - - SRKM5 V:HMER0SPCGSWLVGCLFTIAVFQPPVQVLGDAGKVYt[HpXiRDT 

- ME^SSAGgwLsGgLIPLVFLR-LSVHVSG- -HA GDAG^FHvjJSLgGT 

IjAVPTNgci^L^LLTLTVLQIjPTDDSAAgFDVT APQEPViggvgSD 

- - -^VPTNgC^VgljlaTbTVLQLP^LDSAABFDVT APQEPVjJW.VgSD 



NOV3 

gi| 7305041 I 
gi|l7489l29) 
gi [15808373 j 
gi|l5079315 j 
gi J7304935 | 




110 



PG.TSVP K@V ; RgjLgS P P PQj^C^HI FgTOKTjgDEDL^pg 
gffiQDRD|EDLRpg 

|^spfpo^o|^hif SS kd^£DL|8p" 



YmPGHLP^TQ^HVF^WQDRDEDL^IPfgSl* 



StaRv: 
™tv5v: 



130 



VRD- AQE@SV{3qRl 
jVRD - AHKES YlHrjBs 
|VPJD - AQEgS vjjjQgb 



IPNASSEYMgLLffiFg OTgKgLLYgjgQEffiEGQQ^lTl 

jPNASSEYM§LLBFg QTfj^WiL^YlS^OEjSSGOl 

0 160 170 180 



ggATrjATAGLLDj 

hSatJataglli 
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NOV3 

gi|730504l| 
gi [17489123 | 
gi 15808373 | 
gi|15079315| 
gi|7304935 | 



NOV3 


gi 


7305041 { 


gi 


17489129 


gi 


15808373 


gi 


15079315 


gi 


7304935 | 



NOV3 

gi | 7305041 | 
gi J 17489129 | 
gi jl5808373 I 
gi|l50793l5| 
gi j 7304935 | 



NOV3 

gi j 7305041 1 
gi {17489129 J 
gi (15808373 j 
gi|l5079315 j 
gi j 7304935 | 



NOV3 

gi | 7305041 | 
gi (17489129 | 
gi|l5808373 | 
gi |15079315 j 
gij 7304935 { 



NOV3 

gi | 7305041 j 
gi jl7489129 | 
gi j 15808373 j 
gi|l5079315 j 
gi j 7304935 j 



UEf^rasgigJlgvgNLSKraDTffil g<ffiAF LgRKE Lfflv 




gEgTTAVjVX-SggsBjGSgSgSA^LSvi 



'PBTggSKKHNEEg 
!tB8sKKHNE : E? 



290 



^gEVESsiipgjpgpRgTgwljgvgl) 




300 
•I- 



310 



320 
I 



jPVgGLjjlil^IWLICjfo^ 

SpvBvi5i;mvcLcliw5orrakekl^ehvtevdn3lsdj 



fBtigsifptwS- 



iKER 

jKER 




340 
•I- 



350 



IDIK,-yGNV|SSFIEIjKRAAANSGW 
|RLHK^KKjjj3SELKLKRAAANAGW 
[iCAVKK^SELKLKRAAANSGW 



tKERLXiEEl 
KEgLkEEj 



N0V3 


RRAI 


gi|730504l| 




gi|l7489129| 


RRAI 


gi|l5808373 j 




gi|l5079315 | 


KKT\ 


gi J 7304935 | 




550 



§ 


m 


a 5d 


m 






\ tf 3 








131-' 







WNP 
VNS 
VNS 

pjjGEGCTSGD 
PBGEGCTSGD 



Tables 3E - 3H list the domain description from DOMAIN analysis results against 
NOV3. This indicates that the N0V3 sequence has properties similar to those of other 
proteins known to contain these domains. 
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Table 3E Domain Analysis of NOV3 

gnl 1 Smart 1 smart0044 9 , SPRY, Domain in SPla and the Ryanodine Receptor. 
Domain of unknown function. .Distant homqlogues ape .domains in 
butyrophilin/marenostrin/pyrin homologues. .{SEQ ID NO: 69) 
Length = 125 residues, 96.8% aligned 
Score = 97.1 bits (240), Expect *= 3e-21 



NOV3 : 


3 95 


00449 : 


2 


NOV3 : 


451 


00449 : 


61 


iNOV3 : 


511 


00449: 


117 



I ll+ll I lit I +1 ++'111 I + I . I +1 1+ 

GRHYFEVEVFTGGDKGHWRVGWATKSVPR-GGFRLLGEDKGSWGYDGDGGKKYHNSEFPE 60 



+ii +i iii iii mi + i inn i i * 

YGLPFQEPGDVI GCFLDLEAGTI SFYKNGKYLGLAFFDVTFSGPLYPAVSL GNGGS 116 



Table 3F Domain Analysis of NOV3 

gnl ] Pf am] pf am00622 , SPRY , SPRY domain. SPRY Domain is named from SPla 
and the RYanodine Receptor. Domain of unknown function. Distant 
homologues are domains in butyrophilin/marenostrin/pyrin homologues. 
(SEQ ID NO: 70) 

Length = 124 residues, 97.6% aligned , 

Score = 89.7 bits (221), Expect = 4e-l9 



NOV3 : 


394 


00622 : 


1 


NOV3 : 


451 


00622 : 


60 


NOV3 ; 


510 


00622: 


115 



-I 11+11 I I +1 ++11 II + 1 I ) I++I I 

SGKHYFEVEVDTGGEGHWRVGWATKSV- RKPGESLLGDNEGSWGFDGSGGSKYHNGTGED 5 9 



-I +1 lllllll III 1+ I + +1 Ml I 

YGLPFQEGDV- IGCFLDYEAGEISFTKNGKDLGI YAFRNVSFGGPLYPAV- 



510 TAPLVIC 516 



I 

-SLGSGE 114 



Table 3G Domain Analysis of NOV3 

gnl [ Smart [ smart00406 , IGv, Immunoglobulin V-Type (SEQ ID NO: 71) 
Length = 80 residues, 97.5% aligned 

Score = 49.7 bits (117), Expect = 5e-Q7 



NOV3 : 7 0 TAELLCPLSLWPGTVPKEVRWLRSPFPQRSQAVHIFRDGKDQDEDLMPEYKGRTVLVRDA 12 9 

+ i i i ii+ii + i i mi + +i 

00406: 1 SVTLSCKASG - FTFSSY YVSWVRQP - - PGKGLEWLGY I GSDVS YS - EAS YKGRVTI S KDN 56 

NOV3 : 13 0 QEGSVTLQI LDVRLEDQGSYRC 151 

+ l+l I ++I+II l+l I 
00406: 57 SKNDVSLTI SNLRVEDTGTYYC 78 
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Table 3H Domain Analysis of NO V3 

gnl | Smart 1 sroart004 09 , IG, Immunoglobulin (SEQ ID NO: 72) 

Length » 86 residues/ 94.2% aligned ' r ' 

Score, = 42.7 bits {99), Expect =' 6e~05 . • 

NOV3: 64 VALLGGTAELLCPLSLWPGTVPKEVRWLRSPFPQRSQAVHIFRDGKDQDEDLMPEYKGRT 123 

hill I I I h +11 

00409 : 5 TVKEGESVTLSCEAS- - -GNPPPTVTWYK QGGKLLAESGRF 42 

NOV3: 124 VLVRDAQEGSVTLQILDVRLEDQGSYRCLIQVGNLSKEDTVILQV 16 8 

+ I 1+ 1 1 I 1 1 l + M + f II ' : 

00409: 43 SVSRS - - GGNSTLTI SNVTPEDSGTYTCAATNSSGSASSGTTLTV 85 

The NOV3 gene sequence described herein encodes for a novel member of the 
Immunoglobulin superfamily of enzymes. Specifically, the sequence encodes a novel 
5 ERMAP-Iike protein which is related to the B7-class of immunoglobulins. B7 molecules play 
crucial roles in T-cell activation making them plausible targets for cancer, AIDS, and/or 
inflammation therapies. The protein described here may be useful in treating brain and CNS 
disorders; endocrine, inflammation and autoimmune disorders; pancreatic disorders and 
cancers including lung, pancreas, brain, and prostate. 

1 0 ERMAP, a gene coding for a novel transmembrane protein produced exclusively in 

erythroid cells, is described. It is mapped to murine Chromosome 4, 57 cM distal to the 
centromere. The initial cDNA clone was isolated from a day 9 murine embryonic erythroid 
cell cDNA library. The predicted peptide sequence suggests that ERMAP is a transmembrane 
protein with two extracellular immunoglobulin folds, as well as a highly conserved B30.2 

15 domain and several phosphorylation consensus sequences in the cytoplasmic region. ERMAP 
shares a high homology throughout the. entire peptide with butyrophilin, a glycoprotein 
essential for milk lipid droplet formation and release. A GFP-ERMAP fusion protein was 
localized to the plasma membrane and cytoplasmic vesicles in transiently transfected 293T 
cells. Northern blot analysis and in-situ hybridization demonstrated that ERMAP expression 

20 was restricted to fetal and adult erythroid tissues. ERMAP is likely a novel adhesion/receptor 
molecule specific for erythroid cells (Ye et aL, ERMAP, a gene coding for a novel erythroid 
specific adhesion/receptor membrane protein. Gene 242(l-2):337-45, 2000). 

The above defined information for NOV3 suggests that this NOV3 protein may 
function as a member of a erythroid membrane-associated protein (ERMAP) protein family. 

25 Therefore, the NOV3 nucleic acids and proteins of the invention are useful in potential 

therapeutic and diagnostic applications. For example, a cDNA encoding the NOV3 protein 
may be useful in gene therapy, and the NOV3 protein may be useful when administered to a 
subject in need thereof. By way of nonlimiting example, the compositions of the present 
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invention will have efficacy for treatment of patients suffering from brain disorders including 
epilepsy, eating disorders, schizophrenia, ADD, and cancer; heart disease; inflammation and 
autoimmune disorders including 1 Crohn's disease, IBD, allergies, rheumatoid and 
osteoarthritis, inflammatory skin disorders, blood disorders; psoriasis colon cancer, leukemia 
AIDS; thalamus disorders; metabolic disorders including diabetes and obesity; lung diseases 
such as asthma, emphysema, cystic fibrosis, and cancer; pancreatic disorders including 
pancreatic insufficiency and cancer; and/or prostate disorders including prostate cancer. The 
NOV3 nucleic acid encoding a erythroid membrane- associated protein (ERMAP)-like protein, 
and the erythroid membr ane-associated protein (ERMAP)-like protein of the invention, or 
fragments thereof, may further be useful in diagnostic applications, wherein the presence or 
amount of the nucleic acid or the protein are to be assessed. 

NOV4 

NOV4 includes two novel vitelline membrane outer layer, protein 1 precursor (VMO-I)- 
like proteins disclosed below. The disclosed proteins have been named NOV4a and NOV4b. 

NOV4a 

A disclosed NOV4a nucleic acid of 748 nucleotides (designated CuraGen Acc. No. 
133268995_dal) encoding a novel vitelline membrane outer layer protein I precursor (VMO- 
I)-like protein is shown in Table 4A. An open reading frame was identified beginning with an 
ATG initiation codon at nucleotides 63-65 and ending with a TGA codon at nucleotides 669- 
671. Putative untranslated regions upstream from the initiation codon and downstream from 
the termination codon are underlined in Table 4A, and the start and stop codons are in bold 
letters. 

Table 4A. NOV4a Nucleotide Sequence (SEQ ID NO:7) 

ccctcacagaggccaaactgatataaatctgcttaggaggcctgattcacagacgctacagg atggagcggggcgcagg 
agccaagctgctgccgctgctgctgcttctgcgggcgactggtttcacatgtgcacagacaggtggccggaacggctac 

ACGGCGGTCATCGAAGTGACCAGCGGGGGTCCCTGGGGCGACTGGGCCTGGCCTGAGATGTGTCCCGATGGATTCTTCG 
CCAGCGGGTTCTCGCTCAAGGTGGAGCCTCCCCAAGGCATTCCTGGCGACGACACTGCACTGAATGGGATCAGGCTGCA 
CTGCGCGCGCGGGAACGTCCTAGGCAATACGCACGTGGTAGAGTCCCAGTCTGGAAGCTGGGGCGAATGGAGTGAGCCG 
CTGTGGTGTCGCGGCGGCGCCTACCTAGTGGCTTTCTCGCTTCGCGTGGAGGCACCCACGACCCTCGGTGACAACACAG 
CAGCGAACAACGTGCGCTTCCGCTGTTCAGACGGCGAGGAACTGCAGGGGCCTGGGCTGAGTTGGGGAGACTTTGGAGA 
CTGGAGTGACC ATTG CCC CAAGGG CGCGTG CGGCC TG CAG ACC AAG ATCCAGGG AC CTAGAGG CCTCGGCG ATGACACT 
GCGCTGAACGACGCGCGCTTATTCTGCTGCCGCAGTTG AACGGCGCCGCCGCCGCCGCTCTCTCCCGGGCCAGGAGGCT 
_ _AGTCCCACCTCTTGCTATTAAAGCTTCTCTGAGTTGA , 

The nucleic acid sequence of NOV4a maps to chromosome 17 and has 362 of 595 
bases (60%) identical to a Pseudomonas aeruginosa PA01 mRNA, section 1 17 of 529 of the 
complete genome (gb:GENBANK-ID:AE004556|acc:AE004556.!) (E = 0.00070). 
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A NOV4a polypeptide (SEQ ID NO:8) encoded by SEQ ID NO:7 is 202 amino acid 

" residues and is presented using the one letter code in Table 4B. Signal P, P sort and/or 

Hydropathy results predict that NOV4acontains a signalpeptide and is likely to be localized 

extracellularly with a certainty of 0.3700. The most likely cleavage site for a NOV4a peptide 

5 is between amino acids 24 and 25, at: TCA-QT. 



Table 4B. NOV4a protein sequence (SEQ ID NO:8) 

MERGAGAKLLPIiLLLLRATGFTCAQTGGRNGYTAVIEVTSGGPWGDWAWPEMCPDGFFASGFSLKVEPPQGlPGDDTAtiNGIR 
LHCARGNVLGNTHWESQSGSWGEWSEPLWCRGGAYLVAFSLRVEAPTTLGDNTAANNWPRCSDGEELQGPGLSWGDFGDWS 
PHC PKGACGLQTKI QGPRGLGDDTALNDARLFCCRS ■ 

The NOV4a amino acid sequence has 101 of 193 amino acid residues (52%) identical 
to,* and 122 of 193 amino acid residues (63%) similar to, a Gallus gallus 183 amino acid 
residue vitelline membrane outer layer protein I precursor protein (VMOI) 
10 ' (pto:SWISSPROT-ACC:P41366) (E l.Ge" 43 ). 

NOV4a is expressed in at least the following tissues: Bone, Brain, Cervix, Duodenum, 
Kidney, Lung and Uterus. This information was derived by determining the tissue sources of 
the sequences that were included in the invention including but not limited to SeqCalling 
sources, Public EST sources, Genomic Clone sources, Literature sources, and/or RACE 
15 sources. 

NOV4b 

A disclosed NOV4b nucleic acid of 582 nucleotides (designated CuraGen Acc. No. 
CG56375-04) encoding a novel vitelline membrane outer layer protein I precursor (VMO-I)- 
like protein is shown in Table 4C. An open reading frame was identified beginning with an 
20 ATG initiation codon at nucleotides 116-118 and ending with a TGA codon at nucleotides 
458-460. Putative untranslated regions upstream from the initiation codon and downstream 
from the termination codon are underlined in Table 4C, and the start and stop codons are in 
bold letters. 

Table 4C. NOV4b Nucleotide Sequence (SEQ ID NO:9) 

AGTGGATTAGTTTTGGGTTTCACTTAATCTGGGAGGCCCTTCCGCAATCGGAGCCCTCACAGAGGCCAAAC'reATATAA 
ATCTGCTTAGGAGGCCTGATTCACAGACGCTACAGG ATGGAGCGGGGCGCAGGAGCCAAGCTGCTGCCGCTGCTGCTGC 
TTCTGCGGGCGACTGGTTTCACATGTGCACAGGCAGATGGCCGGAACGGCTACACGGCGGTCATCGAAGTGACCAGCGG 
GGGTCCCTGGGGCGACTGGGCCTGGCCTGAGATGTGTCCCGATGGATTCTTCGCCAGCGGGTTCTCGCTCAAGGTGGAG 
CCTCCCCAAGGCATTCCTGGCGACGACACTGCACTGAATGGGATCAGGCTGCACTGCGCGCGCGGGAACGTCCTAGGCA 
ATACGCACGTGGTAGAGTCCCAGTCTGGAAGGTGGGGCGCAGGGGTCGAGGATCCCTTGGGGTG ATGTATGTCCCTTAC 
CCTCTATTACATACTCCGTCGCAACAGGCAGGCTGGCTCCGCCAGAGCTACGGGGCAGTATAGACCGGGGAGGCGAAGG 
GTGGATGGAAGACCCCTCCTCCGCTCCTC 
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The nucleic acid sequence of NOV4b maps to chromosome 17 and has 474 of 504 
bases (94%) identical to a Homo sapiens mRNA, Sequence 24 from Patent WO0065054 
(gb:GENBANK-ro:A^^ . 

A NOV4b polypeptide (SEQ ID NO: 10) encoded by SEQ ID NO:9 is 1 14 amino acid 
5 residues and is presented using the one letter code in Table 4D. Signal P, Psort and/or 

Hydropathy results predict that NOV 4b contains, a signal peptide and is likely to be localized 
extracellularly with a certainty of 0.3700. The most likely cleavage site for a NOV4b peptide 
is between amino acids 26 and 27, at: AQA-DG. 

Table 4D, NOV4b protein sequence (SEQ ID NO:10) 

' " MERGAGAKLLPbLIjLLRATGFTCAQADGRNdYTAVI EVTSGGPWGDWAWPEMCPDGFFASGFSLKVEPPQGI PGDDTALNGIR 
LHC AjRGNVLGN THWE S QSGRWGAG VED PLG 

10 ( The NOV4b amino acid sequence has 46 of 98 amino acid residues (46%) identical to, 

and 57 of 98 amino acid residues (58%) similar to, a Gallus gallus 1 83 amino acid residue 
vitelline membrane outer layer protein 1 precursor protein (VMO-I) (ptnr:SWISSPROT- 
ACC:P41366)(E = 4.4e- u ). 

NOV4b is expressed in at least the following tissues: Liver. This information was 

1 5 derived from the tissue sources of the sequences that were included in the derivation of the 
NOV4b sequence. 

Possible small nucleotide polymorphisms (SNPs) found for NOV4a are listed in Tables 
4E and 4F. 



Table 4E: SNPs 


Consensus 


Depth 


Base 


PAP 


Position 




Change 




213 


9 


OA 


N/A 


611 


8 


T>C 


N/A 



Table 4F: SNPs 


Variant 


Nucleotide 
Position 


Base 
Change 


Amino 
Acid 
Position 


Base 
Change 


13375453 


93 


OT 


11 


Pro > Ser 


13373914 


138 


A > G 


26 


Thr > Ala 


13374154 


536 


T>C 


Silent 


N/A 



20 

NOV4a and NOV4b are very closely homologous as is shown in the amino acid 
alignment in Table 4G. 

Table 4G Amino Acid Alignment of NOV4a and NOV4b 

10 20 30 40 50 60 70 

. ... i .... I .... I .... I .... I .... I .... I .... I .... I .... I .... I .... I .... I .... 1 
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N0V4a 
W0V4Jb 



SS 

5ZI 



55 



m 



YTAVIEVTSGGPWGDWAWPEMCPDGFFASGFSLKVEPPi 
iQYTAVIEVTSGGPWGDWAWPEMCPDGFFASGFSLKVEPPQ 



100 120 120 130 140 

Ss^EWSEPLWCRGGAYIiVAFSLR2SA[3TTiTODNTAAlM 



150 



160 



170 



180 
■I- 



190 



200 



NOV4a NVRFRCSDGEELQGPGLSWGDFGDWSDHCPKGACGLQTKIQGPRGLGDDTALNDARLFCCRS 
NOV4b 



Homologies to any of the above NOV4 proteins will be shared by the other N0V4 
proteins insofar as they are homologous to each other as shown above. Any reference to 
NOV4 is assumed to refpr to both of the NOV4 proteins in general, unless otherwise noted. 

NOV4a also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 4H. 



Tabl<p 4H. BLAST results for NOV4 



Gene Index/ 
Identifier 



Protein/ Organism 



Length 
(aa) 



Identity 
(%) 



Positives 
(%) 



Expect 



gi ] 17491218 | ref (XP 
058875.1 | 
(XM_058875) 



similar to 
VITELLINE MEMBRANE 
OUTER LAYER 
PROTEIN I 
PRECURSOR (VMO-I) 
(VMOI) {H. 
sapiens) [Homo 
_____ sapiens] 



109 



92/92 
(100%) 



92/92 
(100%) 



2e-44 



gi)l!74974 )sp|P4136 
6 | VMOI CHICK 



VITELLINE MEMBRANE 
OUTER LAYER 
PROTEIN I 
PRECURSOR (VMO-I) 
(VMOI) [Gallus 
gallusj 



183 



87/170 
(51%) 



106/170 
(62%) 



4S-35 



gi 1 57 63 29 )pdbllVMO | 
A 



Chain A, Vitelline 
Membrane Outer 
Layer Protein I 
[Gallus gallus] 



163 



87/170 
(51%) 



106/170 
(62%) 



7e-34 



gi|4567054 | gb | AAD23 
572 . 1 )AF123591 1 
(AF123591) 



fertilization 
envelope outer 
layer protein 
[Cyprinus carpio) 



200 



83/174 
(47%) 



104/174 
(59%) 



2e-32 



gi | 17542904 | ref |NP 
500684 . 1 f 
(NM068283) 



Y9C9A. 1 .p 
[Caenorhabditis 
elegans] 



360 



60/175 
(34%) 



85/175 
(48%) 



2e-18 



The homology of these sequences is shown graphically in the CJustaJW analysis shown 
in Table 41. 



Table 41 ClustalW Analysis of NOV4a 

1) NOV4a(SEQIDNO:8) 

2) gi)1749123 8lref]XP 058875.11 (XMJ)58875) similar to VITELLINE MEMBRANE OUTER LAYER 
PROTEIN I PRECURSOR (VMO-I) (VMOI) (H. sapiens) [Homo sapiens] (SEQ ID NO:73) 

3) pil 1J74974!splP4l366IVMOl_ CHICK VITELLINE MEMBRANE OUTER LAYER PROTEIN I 
PRECURSOR (VMO-I) (VMOI) [Gallus gallus] (SEQ ID NO:74) 

4) gi|576329|pdbllVMO(A Chain A, Vitelline Membrane Outer Layer Protein I [Gallus gallus] (SEQ ID 
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NO:75) 

5) gil456705%blAAD2 3572.3 |AF123591 1 (AF123591) fertilization envelope 6uter layerprotein [Cyprinus 
carpio] (SEQ ID NO:76) 

6) gill7542904(ref|NP 500684. 1| (NM_068283) Y9C9A.l.p [Caenorhabditis elegans] ($EQ ID NO:77) 



NOV4a 

gi | 17491218 | 
gi| 1174974 | 
gi|576329[ 
gi (4567054 j 
gi | 17542904 | 



NOV4a 

gi|l749l218| 
gi f 1174974 j 
gi | 576329 | 
gi | 4567054 | , 
gi|l7542904 | 



NOV4a 

gijl749l218| 
gi jll74974 i 
gij 576329] 
gi [4567054 | 
gi j 17542904 | 



NOV4a 

gi|l749l218| 
gij 1174974 | 
gi|576329} 
gi j 4567054 | 
gi|l7542904 | 



10 



20 



30 



40 



— | — | . . v j — | ... . | — | — |. — | — j : r) ^ 

MERGAGAK^LPLLLLLRATGPTCAQTGGRNGESTAiV^EaTSj 

--MDSSP. 

MKVLTPAALILLFFFY TVD - - ART^ESTSVl-TffiP] 

----- - - RxBEBTSvilTgPNjeft 1 

MlisLLVI TGL'QGS VEARR RRIKRSSdSyBrSE^tSpI^ e ( 

MKCGAKLNLSEFSINTEPLLTISSEMIVIglTLIiLFAI^ 




WPEMgPDgF 



IRQFMHSg - 

ir^fHhsS- 

|S^GQREMS'PAgT 
SEAR-'— gRD@E 



90 



100 



120 



110 

^SsgasHSEPPBGI PG M ^W, N [c»#rtjg HgARGNVLGN TirWEgQSGSg 



130 



140 




290 



ggQVGVSSCNPGYSLVRIGEYDNRRGAGTVTKEFQQRVSITKTSGVTTTLSNSEKYS 
300 310 320 330 



.|....|. 



340 



350 
•I 



NOV4a 

gi|!7491218j ; -- 

gi|ll74974| 

gi | 576329 | ' 

gi|4567054| 

gi | 17542904 j VSKAIEAG1SATHGMFTAKVNTALTKTSERITTSQLQRMIQDATTHERIEKITFTVPAWHRVIVEQLIIT 



N0V4a 

gijl7491218j 
gij 1174974 | 
gij 576329 | 
gi {4567054 | 
gij 17542904 j 



CGGYEVGI AKTI SRS 



The cloning and sequencing of hen magnum cDNAs encoding vitelline membrane 
outer layer protein I (VMOI) was first published on Gene 1 44 (2), 311-312 (1 994). On May 
18, 1994 this sequence version replaced gi:439481 submitted (06-Dec-1993) to DDBJ. Two 
cDNAs encoding hen vitelline membrane outer layer protein 1 (VMO-I), which is classified as 
a new type of multi-beta- sheet assembly, were cloned and sequenced. Northern blot analysis 
using vmo-I cDNA as a probe showed the presence of three mRNA species. Strikingly, 
expression of these mRNAs was restricted to a specific region of the hen oviduct, the area 
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joining the infundibulum to the magnum. The crystal structure of vitelline membrane outer 
layer protein I (VMO-I), which is isolated from the vitelline membrane outer layer of hen's 
eggs, has been determined by the multiple isomorph6tis replacement method and refined to an 
R-factor of 18.8% at 2.2 A resolution. The main chain folds into an unusual structure that 
5 consists of three beta-sheets forming Greek key motifs, which are related by an internal 
pseudo three-fold symmetry. The internal portion surrounded by these three beta-sheets is 
filled with hydrophobic side chains. This conformational feature coincides with three internal 
repeats in the sequence. This new protein fold with internal symmetry has been observed in 
two proteins: vitelline membrane outer layer protein I (VMO-I) and delta-endotoxin. Despite 

1 0 tacking any discernible sequence similarity, both proteins have similar three-dimensional ' 
structures as well as a carbohydrate-binding site in the top region of the common fold. 
1 Although a similar fold exists in the second domain of delta-endotoxin, there are significant 
structural differences between the two proteins, with the three-fold symmetry being most 
regular in VMO-I. Even though the exact function is not known, component of the outer 

1 5 membrane of the vitelline layer of the egg seems to be able to synthesize N-acetylchito- 

oligosaccharides (N=14-15) from hexaxaccharides of N-acetylglusosamine in a manner similar 
to the transferase activity of lysozyme. 

A component of the outer membrane of the vitelline layer of the egg which is able to 
synthesize n- acetyl chito-oligosaccharides (n=14-15) from hexasaccharides of n- 

20 acetylglucosamine in a manner similar to the transferase activity of lysozyme (SWISSPROT- 
ACC.-P41366). 

Lysozyme catalyzes the hydrolysis of certain mucopolysaccharides of bacterial cell 
walls. Specifically, it catalyzes the hydrolysis of the bacterial cell wall beta(l~4) glycosidic 
linkages between N-acetylmuramic acid and N-acetylglucosamine, It is found in spleen, lung, 

25 kidney, white blood cells, plasma, saliva, milk and tears. Alexander Fleming (1881-1955), of 
penicillin fame, discovered and named lysozyme. In a communication to the Royal Society, 
Fleming (Proc. Roy. Soc. Ser. B. 93: 306-317, 1922) wrote: '...I wish to draw attention to a 
substance present in the tissues and secretions of the body, which is capable of rapidly 
dissolving certain bacteria. As this substance has properties akin to those of ferments I have 

30 called it a Lysozyme....* Fleming and Allison (Brit. J. Exp. Path. 3: 252-260, 1922) 

demonstrated an unusually high concentration in cartilage, indeed the highest of any tissue. Its 
role in cartilage is unknown. It resembles lactalbumin in structure. Human lysozyme has a 
molecular mass of 14,602 Da. Neufeld (Personal Communication. Bethesda, Maryland, 1972) 
suggested that a genetic defect of lysozyme might underlie a skeletal dysplasia. Spitznagel et 
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al. (J. Clin. Invest. 51: 93 A only, 1972) observed a patient with selective deficiency of a 
particular type of neutrophil granule which resulted in about 50% reduction in lysozyme 
levels. The patient showed increased susceptibility to infection. ' ' 

Prieur et al. (Am. J. Path. 77: 283-296,1974) described inherited lysozyme deficiency 
5 in rabbits. No abnormality of cartilage or bone was noted (Greenwald et al., Biochim. 

Biophys. Acta 385: 435-437,1975). Older mutant rabbits showed increased susceptibility to 
infections, especially subcutaneous abscesses (Personal Communication. Pullman, 
Washington, 5/13/1975.). Camara et al. (Lab. Invest. 63: 544-550,1990) identified 2 isozymes 
of rabbit lysozyme and showed that their distribution was tissue specific. Leukocytic and 

10 gastrointestinal isozymes were clearly distinguished, and a possible lympho epithelial isozyme 
that resembled the gastrointestinal isozyme electrophoretically and chromatographic ally but 
not kinetically was demonstrated. Mutant, lysozyme-deficient rabbits completely lacked a 
detectable leukocytic isozyme but, had gastrointestinal and lymphoepithelial isozymes 
indistinguishable from those of normal rabbits. By electrophoretic methods, the mutant rabbits 

15 were shown to lack a protein band corresponding to that of the leukocytic isozyme in normal 
rabbits. 

Yoshimura et al. (Biochem. Biophys. Res. Commun. 150:794-801,1988) isolated a 
cDNA encoding human lysozyme from a human placenta cDNA library. The 1.5-kb cDNA 
coded for a signal peptide consisting of 1 8 amino acids and for mature lysozyme. The amino 

20 acid sequence of the mature lysozyme, deduced from the nucleotide sequence, was identical 
with the published sequence. Human lysozyme has 130 amino acid residues and 4 disulfide 
bonds (Taniyama et al., J. Biol. Chem. 266: 6456-6461, 1991). Peters et al. (Cytogenet. Cell 
Genet. 51 : 1059,1989) described the isolation of 2 overlapping genomic clones containing 25 
kb of the human lysozyme gene region. They also isolated a full-length human lysozyme 

25 cDNA clone from a human placental cDNA library. They reported on the nucleotide sequence 
of the entire structural gene and the cDNA clone. Using a panel of somatic cell hybrids, Peters 
et al (Biochemistry 38: 6419-6427,1989) assigned the lysozyme gene to human chromosome 
12. 

Canet et al. (Biochemistry 3 8(20) :64 19-27, 1999) studied the unfolding and refolding 
30 properties of human lysozyme and 2 of its amyloidogenic variants, ile56 to thr and asp67 to 
his, by stopped-flow fluorescence and hydrogen exchange pulse labeling coupled with mass 
spectrometry. Their results suggested that the amyloidogenic nature of the lysozyme variants 
arises from a decrease in the stability of the native fold relative to partially folded 
intermediates. The origin of this instability was different in the 2 variants, being caused in one 
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case primarily by a reduction in the folding rate and in the other by an increase in the 
unfolding rate. In both cases, this resulted in a low population of soluble partially folded 
species that can aggregate in a slow and controlled manner to forrn amyloid fibrils. In the 
human, mutations in the LYZ gene in renal amyloidosis represented the first link of lysozyme 
5 to genetic disease (1 53450.0001). 

The above defined information for NOV4 suggests that this NOV4 protein may 
function as a member of a vitelline membrane outer layer protein I precursor (VMO-I) protein 
family. Therefore, the NOV4 nucleic acids and proteins of the invention are useful in potential 
therapeutic and diagnostic applications. For example, a cDNA encoding the NOV4 protein 

1 0 may be useful in gene therapy, and the NOV4 protein may be useful when administered to a 
subject in need thereof By way of nonlimiting example, the compositions of the present 
1 invention will have efficacy for treatment of patients suffering from osteoporosis, 
hypercalceimia, arthritis, ankylosing spondylitis, scoliosis, neurological disorders and 
diseases, systemic lupus erythematosus, autoimmune disease, asthma, emphysem^, 

15 scleroderma, allergy, ARDS, diabetes, renal artery stenosis, interstitial nephritis, 

glomerulonepliritis, polycystic kidney disease, renal tubular acidosis, IgA nephropathy, 
hypercalceimia, Lesch-Nyhan syndrome, Von Hippel-Lindau (VHL) syndrome, cirrhosis 
and/or transplantation. The NOV4 nucleic acid encoding vitelline membrane outer layer 
protein I precursor (VMO-I)-like protein, and the vitelline membrane outer layer protein I 

20 precursor (VMO-I)-like protein of the invention, or fragments thereof, may further be useful in 
diagnostic applications, wherein the presence or amount of the nucleic acid or the protein are 
to be assessed. 

NOV5 

A disclosed NOV5 nucleic acid of 7676 nucleotides (also referred to as CG56089-01) 
25 encoding a novel MAST205-like protein is shown in Table 5A. An open reading frame was 
identified beginning with an ATG initiation codon at nucleotides 25-27 and ending with a 
TAA codon at nucleotides 7660-7662. Putative untranslated regions upstream from the 
intitation codon and downstream from the termination codon are underlined in Table 5 A, and 
the start and stop codons are in bold letters. 

Table 5A. NOV5 Nucleotide Sequence (SEQ ID NO:ll) 

gaaactggttttcacaggtgatag atggt^ggctgttttcaaaagatcaagcttgaccacatattatcccctccacccatgc 
cgtttcggaaatgcagcaacccagatgtggcttctggccctggaaaatcactgaagtataaaagacagctgagtgaggatgg 
aagacagctaaggcgagggagcctgggaggagccctgactgggaggtaccttcttccaaacccggtggcgggacaggcctgg 
ccggcctctgcagagacgtccaacctcgtgcgcatgcgcagccaggccctgggccagtcggcgccctcgctcaccgccagcc 
tgaaggagctgagtctccccagaagaggaagttttctcactcccaggagcctgagtccaaccccatccagcccaggcagtcc 
ttgtagtcctctcttggcctttcacttttggagccctgtgtgtccaaatgctgggtgccgaacaagcaaccggaaaagctta 
ataggcaatgggcagtcaccagcattgcctcgacc^cactcacctctctctgctcatgcaggaaatagccctcaagatagtc 
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CAAGAAATTTCTCCCCCAGTGCCTCAGCCCATTTTTCATTTGCACGGAGGACTGATGGACGCCGCTGGTCGTTGGCTTCTCT 
CCCTTCCTCTGGCTATGGGACAAACACACCCAGCTCTACGGTCTCTTCATCCTGTTCCTCCCAGGAGAAGTTGCATCAGTTA 
CCATACCAACCAACACCAGACGAGTTACACTTCTTATCAAAACATTTCTGTACCACCGAAAGCATCGCCACTGAGAACAGAT 
GCAGGAACACGCCGATGCGCCCCCGTTCCCGAAGTCTGAGCCCTGGACGTTCTCCCGCCTGCTGTGACCATGAAATAATTAT 
GATGMC(^TGTCTACAAAGAAAGGTTCCCAAAGGCTACAGCTCApATGGAAGAACGTCTAAAGGAAATTATCACCAGCTAC 
TCTCCTGACAACGTTCTACCCTTAGCAGATGGAGTGCTTAGTTTCACTCACCACtAGATTATTGAACTGGCTCGAGATTGgT 
'TGGATAAATCCCACCAGGGCCTCATCACCTCACGATACTTCCTTGAATTACAGCACAAATTAGATAAGTTGCTACAGGAGGC 
TCATGATCGTTCAGAAAGTGGAGAATTGGCATTTATTAAACAACTAGTTCGAAAGATCCTAATTGTTATTGCCCGCCCTGCT 
CGGTTATTAGAGTGCCTGGAATTTGATCCGGAAGAATTTTACTACCTATTGGAAGCAGCAGAAGGCCATGCCAAAGAAGGAC 
AGGGTATTAAAACCGACATTCCCAGGTACATCATTAGCCAACTGGGACTCAATAAGGATCCCTTGGAAGAAATGGCTCATTT 
GGGAAACTACGATAGTGGGACAGCAGAAACACCAGAAACAGATGAATCAGTGAGTAGCTCTAATGCCTCCCTGAAACTTCGA 
AGGAAACCTCGGGAAAGTGATTTTGAAACGATTAAATTGATTAGCAATGGAGCCTATGGGGCAGTCTACTTTGTTCGGCATA 
AAGAATCCCGGCAGAGGTTTGCCATGAAGAAGATTAATAAACAGAACCTCATCCTTCGAAACCAGATCCAGCAGGCCTTTGT 
GGAGCGGGATATCCTGACTTTTGCAGAAAACCCCTTTGTTGTCAGCATGTATTGCTCCTTTGAAACAAGGCGCCACTTGTGC 
ATGGTCATGGAATATGTGGAAGGGGGAGACTGTGCTACTTTAATGAAAAACATGGGTCCTCTCCCTGTTGATATGGCCAGAA 
TGTACTTTGCTGAGACGGTCTTGGCCTTGGAATATTTACATAATTATGGAATTGTACACAGGGATTTGAAACCAGACAGCTT 
GTTGGTTACCTCCATGGGGCACATAAAGCTGACAGATTTTGGATTATCTAAGGTGGGACTAATGAGCATGACTACCAACCTT 
TACGAGGGTCATATTGAGAAGGATGCTAGAGAGTTCCTGGATAAACAGGTCTGTGGCACACCTGAATACATTGCACCAGAAG 
TGATTCTGAGGCAGGGTTATGGAAAGCCGGTGGACTGGTGGGCCATGGGGATTATCCTCTATGAATTTCTGGTTGGATGCGT 
GCCATTCTTTGGGGATACTCCAGAGGAGCTATTTGGACAAGTCATCAGTGATGAGATCAACTGGCCTGAGAAGGATGAGGCA 
CCCCCACCTGATGCCCAGGATCTGATTACCTTACTCCTCAGGCAGAATCCCCTGGAGAGGCTGGGAACAGGTGGTGCATATG 
AAGTCAAACAGCATCGATTCTTCCGTTCTTTAGACTGGAACAGTTTGCTGAGACAGAAGGCAGAATTTATTCCCCAACTGGA 
AT CTG AGG ATG AC AC AAGTT AT TTTG ATACT CGGT CTGAG AAGTATC ATC ATATGG AAACGG AGG AAG AAG ATG A CA CAAAT 
GATGAAGACTTTAATGTGGAAATAAGGCAGTTTTCTTCATGTTCACACAGGTTTTCAAAAGTTTTCAGCAGTATAGATCGAA 
TCACTCAGAATTCAGCAGAAGAGAAGGAAGACTCTGTGGACAAAACCAAAAGCACCACCTTGCCATCCACAGAAACACTGAG 
CTGGAGTTCAGAATATTCTGAAATGCAACAGCTATCAACATCCAACTCTTCAGATACTGAAAGCAACAGACATAAACTCAGT 
TCTGGCCTACTTCCCAAACTGGCTATTTCAACAGAGGGAGAGCAAGATGAAGCTGCCTCCTGCCCTGGAGACCCCCATGAGG 
AGCC AGGAAAG CCAGCCCTTCCT CCTG AAGAGTGTGCCC AGG AGG AG CCTG AGGTC ACC AC CCCAG C C AG CACCATC AG CAG 
CTCCACCCTGTCAGTTGGCAGTTTTTCAGAGCACTTGGATCAGATAAATGGACGAAGCGAGTGTGTGGACAGTACAGATAAT 
TCCTCAAAGCCATCCAGTGAACCCGCTTCTCACATGGCTCGGCAGCGATTAGAAAGCACAGAAAAAAAGAAAATCTCGGGGA 
AAGTCACAAAGTCCCTCTCTGCCAGTGCTCTTTCCCTCATGATCCCAGGAGATATGTTTGCTGTTTCCCCTCTGGGAAGTCC 
AATGTCTCCCCATTCCCTGTCCTCGGACCCTTCTTCTTCACGAGATTCCTCTCCCAGCCGAGATTCCTCAGCAGCTTCTGCC 
AGTCCACATCAGCCGATTGTGATCCACAGTTCGGGGAAGAACTACGGCTTTACCATCCGAGCCATCCGGGTGTATGTGGGAG 
ACAGTGACATCTATACAGTGCACCATATCGTCTGGAATGTAGAAGAAGGAAG'TCCGGCATGCCAGGCAGGACTGAAGGCTGG 
AGATCTTATCACTCACATCAATGGAGAACCAGTGCATGGACTTGTCCACACAGAAGTTATAGAACTCCTACTGAAGAGTGGG 
AATAAGGTGTCAATCACTACTACCCCATTTGAAAACACATCAATCAAAACTGGACCAGCCAGGAGAAACAGCTATAAGAGCC 
GGATGGTGAGGCGGAGCAAGAAATCCAAGAAGAAAGAAAGTCTCGAAAGGAGGAGATCTCTTTTCAAAAAGCTAGCCAAGCA 
GCCTTCTCCTTTACTCCACACCAGCCGAAGTTTCTCCTGCTTGAACAGATCCCTGTCATCGGGTGAGAGCCTCCCAGGTTCC 
CCCACTCATAGCTTGTCTCCCCGGTCTCCAACACCAAGCTACCGCTCCACCCCTGACTTCCCATCTGGTACTAATTCCTCCC 
AGAGCAGCTCCCCTAGTTCTAGTGCCCCCAATTCCCCAGCAGGGTCCGGGCACATCCGGCCCAGCACTCTCCACGGTCTTGC 
ACCCAAACTCGGCGGGCAGCGGTACCGGTCCGGAAGGCGAAAGTCCGCCGGCAACATCCCACTGTCCCCGCTGGCCCGGACG 
CCCTCTCCAACCCCGCAACCCACCTCCCCGCAGCGGTCACCATCCCCTCTTCTGGGACACTCACTGGGCAATTCCAAGATCG 
CGCAAG CCTTTCC C AGCAAG ATG C ACT CCCCG C CCACC ATCG TC AG AC AC ATCGTG AGGC CCAAG AGTGCGG AGCCCCCCAG 
GTCCCCGCTGCTCAAGCGCGTGCAGTCCGAGGAGAAGCTGTCGCCCTCTTACGGCAGTGACAAGAAGCACCTGTGCTCCCGC 
AAGCACAGCCTGGAGGTGACCCAAGAGGAGGTGCAGCGGGAGCAGTCCCAGCGGGAGGCGCCGCTGCAGAGCCTGGATGAGA 
ACGTGTGCGACGTGCCGCCGCTCAGCCGCGCCCGGCCAGTGGAGCAAGGCTGCCTGAAACGCCCAGTCTCCCGGAAGGTGGG 
CCGCCAGGAGTCTGTGGACGACCTGGACCGCGACAAGCTGAAGGCCAAGGTGGTGGTGAAGAAAGCAGACGGCTTCCCAGAG 
AAACAGGAATCCCACCAGAAATCCCATGGACCCGGGAGTGATTTGGAAAACTTTGCTCTGTTTAAGCTGGAAGAGAGAGAGA 
AGAAAGTCTATCCGAAGGCTGTGGAAAGGTCAAGTACTTTTGAAAACAAAGCGTCTATGCAGGAGGCGCCACCGCTGGGCAG 
CCTGCTGAAGGATGCTCTTCACAAGCAGGCCAGCGTGCGCGCCAGCGAGGGTGCGATGTCGGATGGCCCGGTGCCTGCGGAG 
CACCGCCAGGGTGGCGGGGACTTCAGACGGGCCCCCGCTCCTGGCACCCTCCAGGATGGTCTCTGCCACTCCCTCGACAGGG 
GCATCTCTGGGAAGGGGGAAGGCACGGAGAAGTCCTCCCAGGCCAAGGAGCTTCTCCGATGTGAAAAGTTAGACAGCAAGCT 
GGCCAACATCGATTACCTCCGAAAGAAAATGTCACTTGAGGACAAAGAGGACAACCTCTGCCCTGTGCTGAAGCCCAAGATG 
ACAGCTGGCTCCCACGAATGCCTGCCAGGGAACCCAGTCCGACCCACGGGTGGGCAGCAGGAGCCCCCGCCGGCTTCTGAGA 
GCCGAGCTTTTGTCAGCAGCACCCATGCAGCTCAGATGAGTGCCGTCTCTTTTGTTCCCCTCAAGGCCTTAACAGGCCGGGT 
GGACAGTGGAACG GAG AAG CCTGGCTTGGTTGCTCCTGAGTCCCCTGTT AGG AAG AGCC CCTCCGAGTATAAGCTGGAAGGT 
AGGTCTGTCTCATGCCTGGAGCCGATCGAGGGCACTCTGGACATTGCTCTCCTGTCCGGACCTCAGGCCTCCAAGACAGAAC 
TGCCTTCCCCAGAGTCTGCACAGAGCCCCAGCCCAAGTGGTGACGTGAGGGCCTCTGTGCCACCAGTTCTCCCCAGCAGCAG 
TGGGAAAAAGAACGATACCACCAGTGCAAGAGAGCTTTCTCCTTCCAGCTTAAAGATGAATAAATCCTACCTGCTGGAGCCT 
TGGTTCCTGCCCCCCAGCCGAGGTCTCCAGAATTCACCAGCAGTTTCCCTGCCTGACCCAGAGTTCAAGAGGGACAGGAAAG 
GTCCCCATCCTACTGCCAGGAGCCCTGGAACAGTCATGGAAAGCAATCCCCAACAGAGAGAGGGCAGCTCCCCTAAACACCA 
AGACCACACCACTGACCCCAAGCTTCTGACCTGCCTGGGGCAGAACCTCCACAGCCCTGACCTGGCCAGGCCACGCTGCCCG 
CTCCCACCTGAAGCTTCCCCCTCAAGGGAGAAGCCAGGCCTGAGGGAATCGTCTGAAAGAGGCCCTCCCACAGCCAGAAGCG 
AGCGCTCTGCTGCGAGGGCTGACACATGCAGAGAGCCCTCCATGGAACTGTGCTTTCCAGAAACTGCGAAAACCAGTGACAA 
CTCCAAAAATCTCCTCTCTGTGGGAAGGACCCACCCAGATTTCTATACACAGACCCAGGCCATGGAGAAAGCATGGGCGCCG 
GGTGGGAAAACGAACCACAAAGATGGCCCAGGTGAGGCGAGGCCCCCGCCCAGAGACAACTCCTCTCTGCACTCAGCTGGAA 
TTCCCTGTGAGAAGGAGCTGGGCAAGGTGAGGCGTGGCGTGGAACCCAAGCCCGAAGCGCTTCTTGCCAGGCGGTCTCTGCA 
G CCACCTGG AATTG AG AGTG AGAAGAGTGAAAAGCTCT CCAGTTT CC CATC T TTG CAG AAAGATGGTGCCAAGG AAC CTG AA 
AGGAAGGAGCAGCCTCTACAAAGGCATCCCAGCAGCATCCCTCCGCCCCCTCTGACGGCCAAAGACCTGTCCAGCCCGGCTG 
CCAGGCAGCATTGCAGTTCCCCAAGCCACGCrTCTGGCAGAGAGCCGGGGGCCAAGCCCAGCACTGCAGAGCCCAGCTCGAG 
CCCCCAGGACCCTCCCAAGCCTGTTGCTGCGCACAGTGAAAGCAGCAGCCACAAGCCCCGGCCTGGCCCTGACCCGGGCCCT 
CCAAAGACTAAGCACCCCGACCGGTCCCTCTCCTCTCAGAAACCAAGTGTCGGGGCCACAAAGGGCAAAGAGCCTGCCACTC 
AGTCCCTCGGTGGCTCTAGCAGAGAGGGGAAGGGCCACAGTAAGAGTGGGCCGGATGTGTTTCCTGCTACCCCAGGCTCCCA 
GAACAAAGCCAGCGATGGGATTGGCCAGGGAGAAGGTGGGCCCTCTGTCCCACTGCACACTGACAGGGCTCCTCTAGACGCC 
AAGCCACAACCCACCAGTGGTGGGCGGCCCCTGGAGGTGCTGGAGAAGCCTGTGCATTTGCCAAGGCCGGGACACCCAGGGC 
CTAGTGAGCCAGCGGACCAGAAACTGTCCGCTGTTGGTGAAAAGCAAACCCTGTCTCCAAAGCACCCCAAACCATCCACTGT 
GAAAG A TTGCCCCACCCTGT G CAAACAGACAGACAACAGACAGAC A GACAAAAGCCCGAGTCAGCCGGCCGCCAACACCGA C_ 
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AGAAGGGCGGAAGGGi^GAAATGCACTGAAGCACTTTATGCTCCAGCAGAGGGCGACAAGCTCGAGGCCGGCCTTTCCTTTG 
TGCATAGCGAGAACCGGTTGAAAGGCGCGGAGCGGCCAGCCGCGGGGGTGGGGAAGGGC^TCCCTGAGGCCAGAGGGAAAGG 
GCCCGGTCCCCAGAAGCCACCGACGGAGGCAGACAAGCCCAATGGCATGAAACGGTCCCCCTCAGCCACTGGGCAGAGTTCT 
TTCCGATCC^CGGCCCTCCCGGAAAAGTCTCTGAGCTGCTCCTCCAGCTTCCCTGAAACCAGGGCCGGAGTTAGAGAGGCCT 
CTGCAGCCAGCAGCGACACCTCT'i;CTGCCAAGGCCGCCGGGGGCATGCTGGAGCTTCCAGCCCCCAGpAACAGGGACCATAG 
GAAGGCTCAGCCTGCCGGGGAGGGCCGAACCCACA1X3ACAAAGAGTGACTCCCTGCCCTCCTTCCGGGTCTCCACCCTGCCT 
CTGGAGTCACACCACCCCGACCCAAACACCATGGGCG6GGCCAGCCACCGGGACAGGGCTCTCTCGGTGACTGCCACCGTAG 
GGGAAACCAAAGGGAAGGACCCTGCCCCAGCCCAGCCTCCCCCAGCTAGGAAACAGAACGTGGGCAGAGACGTGACCAAGCC 
ATCCCCAGCCCCAAACACTGACCGCCCCATCTCTCTTTCTAATGAGAAGGACTTTGTGGTACGGCAGAGGCGGGGGAAAGAG 
AGTTTGCGTAGCAGCCCTCACAAAAAGGCCTTGTAACGGGGAGGGCCCAG 



The NOV5 nucleic acid was identified on chromosome 5 and has 1549 of 2075 bases 
(74%) identical to a Mus musculUs MASTi>05 protein kinase mRNA (gb:GENBANK- ' ; 
ID:MMU02313|acc:U02313.1) (E - 2.3e 307 ). 
5 A disclosed NOV5 polypeptide (SEQ ID NO: 12) encoded by SEQ ID NO: 1 lis 2545 

amino acid residues and is presented using the one-letter code in Table 5B. Signal P, Psort 
and/or Hydropathy results predict that NOV5 does not contain a signal peptide and is likely to 
be localized to the nucleus with a certainty of 0,9000 and to the golgi body with a certainty of 
0.9000. 



Table 5B. Encoded NOV5 protein sequence (SEQ ID NO:12) 

MEGCFQKIKLDHILSPPPMPFRKCSNPDVASGPGKs^KYKRQLSEDGRQLRRGSLGGALTGRYIiLPNPVAGQAWPASAETS 
NLVRWRSQALGQSAPSLTASLKELSLPRRGSFLTPRSIjSPTPSSPGSPCSPLLAFHFWSPVCPNAGCRTSNRKSLIGNGQS 
PALPRPHSPLSAHAGNSPQDSPRNFSPSASAHFSFARRTDGRRWSIiASLPSSGYGTNTPSSTVSSSCSSQEKLHQLPYQPT 
PDELHFLSKHFCTTESIATENRCRNTPMRPRSRSLSPGRSPACCDHEIIMMNHVYKERFPKATAQMEERLKEIITSYSPDN 
VliPLADGVLSFTHHQIIELARDCLDKSHQGLITSRYFLELQHKLDKLLQEAHDRSESGELAFIKQLVRKIIilVIARPARLL 
ECLEFDPEEFYYLLEAAEGHAKEGQGIKTDIPRYIISQLGI^KDPk^ 

PRESDFETIKHSNGAYGAVYFVRHKESRQRFAMKKINKQNLILRNQIQQAF,VERDILTFAENPFWSMYCSFETRRKLCM 
VMEYVEGGDCATLMKNMG PLPVDMARMYFAETVLALE YLHNYG I VHRDLKPDSI>LVTSMGHI KLTDFGLSKVGLMSMTTNIj 
YEGHI EKDAREFLDKQVCGTPEYI APEVI LRQGYGKPVDWWAMG 1 1 LYE FLVGCVPFFGDTPEEIiFGQVISDE INWPEKDE 
APPPDAQDLITLLLRQNPLERLGTGGAYEVKQHRFFRSLDWNSLLRQKAEFIPQLESEDDTSYFDTRSEKYHHMETEEEDD 
TNDEDFNVEIROFSSCSHRFSKVFSSIDRITQNSAEEKEDSVDKTKSTTLPSTETLSWSSEYSEMQQLSTSNSSDTESNRH 
KLSSGIjLPKLAISTEGEQDEAASCPGDPHEEPGKPALPPEECAQEEPEVTTPASTISSSTLSVGSFSEHLDQINGRSECVD 
STDNSSKPSSEPASHMARQRLESTEKKKISGKVTKSLSASALSLMIPGDMFAVSPLGSPMSPHSLSSDPSSSRDSSPSRDS 
SAASASPHQPIVIHSSGKI^GFTIRAIRVYVGDSDIYTVHHIVWNVEEGSPACQAGLKAGDIiITHINGEPVHGLVHTEVIE 
LliLKSGNKVSITTTPFENTSlKTGPARRNSYKSRMVRRSKKSKK^ 

GESLPGSPTHSLSPRSPTPSYRSTPDFPSGTN^SQSSSPSSSAPNSPAGSGHIRPSTLHGLAPKIiGGORYRSGRRKSAGNI 
PLSPLARTPSPTPQPTSPQRSPSPLLGHSLGNSKIAQAFPSKMHSPPTIVRHIVRPKSAEPPRSPLLKRVQSEEKLSPSYG 
SDKKHLCSRKHSLEVTQEEVQREQSQREAPLQSLDEUVCDVPPLSRARPVEQGCLKRPVSRKVGRQESVDDLDRDKLKAKV 
WKKADGFPEKQESHQKSHGPGSDLENFALFKLEEREKKVYPKAVERSSTFENKASMQEAPPLGSLLKDALHKQASVRASE 
GAMSDGPVPAEHRQGGGDFRRAPAPGTLQDGLCHSLDRGISGKGEGTEKSSQAKELLRCEKLDSKLANIDYLRKKMSLEDK 
EDNLCPVLKPKMTAGSHECLPGNPVRPTGGQQEPPPASESRAFVSSTHAAQMSAVSFVPLKALTGRVDSGTBKPGLVAPES 
PVRKSPSEYKLEGRSVSCLEPIEGTLDIALLSGPQASKTELPSPESAQSPSPSGDVRASVPPVLPSSSGKKNDTTSAREL.S 
PSSbKMNKSYLLEPWFLPPSRGLQNS PAVSIiPDPEFKRDRKGPHPTARSPGTVMESNPQQREGSSPKHQDHTTDPKLLTCL 
GQNLHSPDLARPRCPLPPEASPSREKPGLRESSERGPPTARSERSAARADTCREPSMELCFPETAKTSDNSKNLLSVGRTH 
PDFYTQTQAMEKAWAPGGKTNHICDGPGEARPPPRDNSSliHSAGIPCEKELGKVRRGVEPKPEALLARRSLQPPGIESEKSE 
KLSSFPSLQKDGAKEPERKEQPLQRHPSSIPPPPLTAKDLSSPAARQHCSSPSHASGREPGAKPSTAEPSSSPQDPPKPVA 
AHSESSSHKPRPGPDPGPPKTKHPDRSLSSQKPSVGATKGKEPATQSLGGSSREGKGHSKSGPDVFPATPGSQNKASDGIG 
QGEGGPSVPLHTDRAPLDAKPQPTSGGRPLEVLEKPVHLPRPGHPGPSEPADQKLSAVGEKQTLSPKHPKPSTVKDCPTLC 
KQTDNRQTDKSPSQPAANTDRRAEGKKCTEALYAPAEGDKLEAGLSFVHSENRLKGAERPAAGVGKGFPEARGKGPGPQKP 
PTEADKPNGMKRSPSATGQSSFRSTALPEKSbSCSSSFPETRAGVREASAASSDTSSAKAAGGMLELPAPSNRDHRKAQPA 
GEGRTHMTKSDSLPSFRVSTLPLESHHPDPNTMGGASHRDRALSVTATVGETKGKDPAPAQPPPARKQNVGRDVTKPSPAP 
NTDRPISLSNEKDFWRQRRGKESLRSSPHKKAL 



The NOV5 amino acid sequence has 613 of 926 amino acid residues (66%) identical 
to, and 706 of 926 amino acid residues (76%) similar to, a Mus musculus 1734 amino acid 
residue microtubule associated testis specific serine/threonine protein kinase (205-kda testis- 
specific serine/threonine protein kinase MAST205) (ptnr:SPTREMBL-ACC:Q60592) (E = 
0.0), 
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NOV5 is expressed in at least the following tissues: Adipose, Adrenal 
Gland/Suprarenal gland) Amygdala, Aorta, Artery, Bone Marrow, Brain, Bronchus, Brown 
Adipose, Cerebral MedulWCerdbral white matter, Cervix, Chorionic Villus') Colon, Coronary 
Artery, Dermis, Duodenum, Heart, Hippocampus, Hypothalamus, Kidney, Lung, Lung Pleura, 
Lymph node, Lymphoid tissue, Mammary gland/Breast, Esophagus, Ovary, Oviduct/Uterine 
Tube/Fallopian tube, Pancreas, Parotid Salivary glands, Peripheral Blood, Pituitary Gland, 
Placenta, Prostate, Respiratory Bronchiole;, Retina, Cerebellum, Skeletal Muscle, Skih^ Small 
Intestine, Spinal Chord, Spleen, Stomach, Substantia Nigra, Temporal Lobe, Testis, Thymus, 
Thyroid, Tonsils, Umbilical Vein, yrinary Bladder, Uterus, Vein and Vulva. This information 
was derived by determining the tissue sources of the sequences that were included in the 
invention including but not limited to SeqCalling sources, Public EST sources, genomic' clone 
sources, literature sources, and/or RACE sources. In addition, NOV5 is predicted to be 
expressed in testis tissue because of the expression pattern of a closely related Mus musculus 
MAST205 protein kinase mRNA homolog (GENBANK-ID: gb:(jENBANK-* 
ID:MMU02313|acc:U02313.1). 



NOV5 has homology to the amino acid sequences shown in the BLASTP data listed in 
Table 5C. 



Table 5C. BLAST results for NOV5 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi|4589590|dbi j BAA 7 
6817.1 | (AB023190) 


KIAA0973 protein 
[Homo sapiens] 


1583 


744/1341 
(55%) 


868/1341 
(64%) 


0.0 


gi| 14732014 Iref |XP 


KIAA0303 protein 
[Homo sapiens] 


2092 


' 1847/209 
2 (88%) 


1849/2092 
(88%) 


0.0 


045292. 1| 
(XM 045292) 


9i|3043646|dbj | BAA2 


KIAA0561 protein 
[Homo sapiens] 


1308 


664/1184 
(56%) 


779/1184 
(65%) 


0.0 


5487.1) (AB011133) 


gi|2224547|dbj ( BAA2 


KIAA03 03 [Homo 
sapiens] 


2137 


1893/213 
7 (88%) 


1894/2137 
(88%) 


0.0 


0762. lj (AB002301) 


gi |6678958|ref (NP 0 
32667.1 | 
(NM__008641) 


microtubule 
associated testis 

specific 
serine/ threonine 
protein kinase 
[Mus musculus] 


1734 


804/1424 
(56%) 


924/1424 
(64%) 


0.0 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
20 in Table 5D. 

Table 5D Clustal W Sequence Alignment 

1) NOV5 (SEQIDNO:12) 

2) g il458 95901dbj|BAA268mj (AB023 190) KIAA0973 protein [Homo sapiens] (SEQ ID NO:78) 
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3) ei|l'4732014jreflXP_ 045292,1 1 (XMJ)45292) KIAA0303 protein [Homo sapiens] (SEQ ID NO:79) 

4) gi)30436461dbi|BAA25487.11 (AB01 1 133) KIAA0561 protein [Homo sapiens] (SEQ ID NO:80) 

5) gi ]2224S47ldbi|BAA20762. 1 1 (AB002301) KIAA0303 [Homo sapiens] (SEQ ID NO:81) 

6) &i|6678958lref1NP 032667.11 (NMJ)08641) microtubule associated testis specific serine/threonine protein 
kinase [Mus musculus] (SEQ ID NO: 82) ' 11 1 



NOV5 

gi|4589590} 
gi|l4732014 | 
gi|3043646 [ 
gi j 2224547 | 
gij 6678958) 



NOV5 

gi | 4589590 j 
gi| 14732014 
gi |i3043646 
gi|2224547 
gij6678958 



NOV5 




4589590 | 


gi 


14732014 


gi 


3043646 


gi 


2224547 


gi 


6678958 



NOV5 

gi|4589590| 
gi |14732014 ( 
gi j3043646 | 
gij 2224547| 
gi|6678958 j 



NOV5 

gi|4589590| 
gi |14732014 
gi|3043646' 
gij 2224547 
gi j 6678958 



gi 
gi 
gi 
gi 
gi 



4589590 j 
14732014 | 
3043646 | 
2224547 j 
6678958 



N0V5 


gi 


4589590 j 


gi 


14732014 


gi 


3043646 | 


gi 


2224547 j 


gi 


6678958 j 



NOV5 



g^ 
gi 
gi 
gi 



4589590] 
14732014 | 
3043646| 
2224547 j 
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20 



30 



40 .50 60 70 
.|....|..,.|....|....|....|....|....| 
MEGCFQKIKUDHILSPPPMPFRKCSNPDVASGPGKSLKYKRQLSEDGRQIjRRGSLGGAIjTGRYLIjPNPVA 
LLPHLPPCRRRRVMS ■ 



- -,- -DESSLLRRRG ■ 

-MVTGLSPLLFRKLSiqPDlFAPTGKVKLQRQLSQDDC 



80 



90 



110 



120 



130 



140 



GQAWPASAETSNLVRMRSQAIjGQS APS LTAS liKELSLPRRGS FLTPRSLS PTKSS PGS PCS PLLAFHFWS 
DSLWTAL@N FSMPSFPGG SMFRRTK 

LQKEL@ LPRRGR 

KLRRGSLASSL^GKQLLPLSSSVHSSVGQVTWQSTGEASNLVRMRNQSLGQSAPSLTAGLKELSLPRRGS 

150 160 170 180 190 200 210 

| | | | | | - . . ... | • •[ . . . • | | | | ] . | 

PVCPNAGCRTSNRKSLIGNGQS[j^VL@RPHSPLS!&HAG 
SCRTSNRKSLlfcTS-TSPjiLPRlsHsSL^ 



GCRSGNRKSLVgGT- PSPgLSR^sBLSVPTAGgsPLjSSjaaiFSAASALNFPFA- - RRADGRRWSlgSLP 

FCRTSNRKSLIpSSTSpgbPR0HSgLHG-HTGNSPLE^^FSPNAPAHFSFVPARRTDGRRWSLgSLP 

220 230 240 250 260 270 280 

LPSsfeG&^PgST^SgcSSOEKLHQLPYQPTPDELHFLSKHFCTTESIATENRCRNTPMRPRSRSLSP 
SSGYgTN^SS^S@C@SQERLHQLPYQPTVDELHFbSKHFGSTESITDEDGGRRS0AVRPRSRSLSPG 

SSGY@Tr^SS'tfl£s^ 

SSGY^N^§S^^S@C^QE^LHQLPFQPTADEIiHFIjTKHF - STENVPDEEG - RRS^RMRPRSRSLSPG 

290 300 310 320 330 340 350 

GRSPACCDHE^ ij^ 
RSPSSYDNEiywjgNj^VY^ 

ratgtfdnei^JMS^ 
rspvsfdseiSSBnBw 




EGE^#^sR2^I|^EKi2ERI5^ 

430 440 450 460 470 




480 





520 



. i 



530 
. . 1 . . 



540 
..I .. 



iLSEGR--gsgAKKPj 

isSsnaqL^i 
peHralvgqs 

~js§SNAgL|5l- 

[EGRG 1 



550 



490 
J 



RRKP3ES 
RRKP8ES 

rrkpSes 



RQRFAMKKigKCj^ILRjHQIQQAFVERDILTFAEgPFWSMSCSFE 
RQRFAMKKlBKStilLRSQIQQAFVERDIIjTFAESPFVVgMaCSFE 

rqrfamiociSkSlilrSqiqq 

:qrfa0kkiSkqSlilrBqi qqS f verdi IiTfaeSppwsi^cs f: 
'orfamkkt[SkoSijIlr)woiooafverdiltfaeSpfvvsp^3csf: 
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gi|6678958 | 



DFETI KlilSjfiGAYGA' 



QRFAMKKIiaKQlSLILRgiQIOQAFVERDILTFAEiSPFVVSI 



570 



630 



NOV5 

gi|45S9590| 
gi 1 14732014 | 
gi |3043646| 
gi)2224547| 
gi (6678958 j 



TRRHLCMVMEYVEGGDCATL 1 
TJRRHLCMVMEYVEGGDCAT 
TRRHLCMVWEYVEGGDCAT 
rRRHI«CMVMEYVEGGDCATL' 
TRRHLCMVMEYVEGGDCAT 
RHLCMVMEYVEGGDCAT 




N0V5 

gi|4589E90| 



gi 
gi 



14732014 | 
3043646 | 
2224547| 
6678958] 



NOV5 

gi | 4589590 J 
gi j 14732014 | 
gi {3043646 j 
gi|2224547 j 
gij 6678958| 



N0V5 

gi|4589S90| 
gi|l4732014 | 
gi j3043646 | 
gi|2224547| 
gi |6678958| 



N0V5 

gi]4589590| 
gi|l4732014 ] 
gi j3043646| 
gi)2224547| 
gi|6678958 j 



NOV5 

gi 4589590j 
14732014 | 
3043646| 
2224547 



gi 
gi 
gi 




850 



I 



860 



880 



870 
: .J — 

TgNSAEE -iKjgpsEEE^KST^LPSjTET! 

JSKHE PKT P VAAAGS S KRE P S TKG pjE^KVAG^REG LGGlAjbREl 

FgNSAEE K^SjjgEfKSTSjLPSTET: 

AVQPTPTFAER SFS EDREEGWERSEj^G^RLS I - ADIRtiRE 

5j§NSA®E - 'te)sBJpraST|TLP5sTETI, 

SkbEERR TPPPTK^SbS@EKE|^SBGLAGLKGRDRg 




gi|6678958 j 



920 930 940 

20Kiju:g0EGg obpA&flc fSgg PHj 

3j^rrf^ll| psrf)sapqe|wed] 

33klai@5eg^ qd^aaIScEESp] 

JwTJSIiDMMPjKjFAFSSiEDEGVG 

_^KLAlg3EG@ QbgAA W " 

5pPMTVRHRC^LPbGPHCPE]ST£ 



950 



970 



jPG-jK 
.RLRR 

|pg-1k 
pagpkrp-vfi| 




980 

PEBc^EEPEHTTPA^I-gssgLHvfflS 
S SEJPSGS LDARAPKEEmQG EG SsgAMD 
PE§C@QEEPEBlTPAgSlBffS WL@V@S 

epSPpppaatpRmpkpBsiSap Jaalsh 

PE@c2QEEPEaTTPASilg£fs2L@ygS 

G^SRPVPERPIiERQIjKliDEEppSQ 



1000 



1020 



104p 



1050 



gi |4589590 j 
gij 14732014 | 
gi J3043646' 
gi [2224547 
gi [6678958 



NOV5 

gi)4589590| 
gi j 14732014 | 
gi |3043646| 
gi}2224547| 
gi|6678958i 



NOV5 



FgEHI^E3N^--fECVgST^esgPB@EP@SHM^Q|3lE| 

seatdrprp? dlcppsk^da^gp^atndIlvlrr^hq'qm j 

FEEHl^^^--SECv3§TgNg|2 p §S5 P S SHM SQS! lE | 

arlrsKsmgaShs^prpldagrgrrlggprdpapeK'S^ 

FBEHljfoPN fffl - - SECV(>|STdNS S^PS^EpEjSHM 
S@RCCPAljET^RGTPQI^EjBATASAI^L5vRR 
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9^ 

gi 
gi 
gi 



4589590 j 
14732014 | 
3043646 ( 
2224547| 
6678958 j 



NOV5 

gi|4589590 | 
gi|l4732014 | 
gi|3043646| 
gi|2224547 | 
gi|6678958| 



NOV5 

gi 1 4589590 | 



gi 
gi 
gi 
gi 



14732014 I 
3043646| 
2224547| 
6678958 j • 



NOVS 

gi|4589590| 
gi|l4732014 | 
gi|3043646 | 
gi| 2224547 | 
gi|6678958l 



NOV5 

gi|4589590 J 
gi J14732014 | 
gi}3043646 | 
gi|2224547 j 
gi | 6678958 [ 



gi 
gi 
gi 
gi 
gi 



4589590| 

14732014 

3043646 

2224547 

6678958 



NOV5 

gi|4589590 | 
gi[l4732014 | 
gij3043646 | 
gi|2224547 | 
gi|6678958 j 



NOV5 

gi 
gi 
gi 
gi 
gi 



4589590 j 
14732014 | 
3043646| 
2224547 j 
6678958 ] 



NOV5 

gi|4589590| 
gi|l4732014| 
gi | 3043646 | 
gi(2224547| 
gi|6678958| 




RRKSAGgttPLSP 

r^ksagSiplsp 
rrksagSiplsp; 
RRKSjBsipgsp; 

RRKSAGgJIPLSP 

Irrksagsiplsp 



RPKSAEPP 
RPKSAEPP 



1430 




1440 



1450 



1470 



;ct>y 

IpsoE- - -^pBq^B^c^vSSI^ 

K- gDGEAGRRJTg 
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|LTpjSEASPLE^GTRSggSGKG03pGK 



3HP|3F)RKBFHG- - - -f 

EfrQE@yQi 



1520 
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..|....|. 
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1610 
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1620 
..|.... 
GTLQgjGLCHSLDRGI S| 



QgGL 



1630 




1650 



.SEGA^sj^PVPA|^R^GGb/|R^SAB 
iPPVS'SKE - - KES PGGABACTP p|3aTTP - 
BjEG^st^PVPASSR^GGjDFgRftl^ 

iEVSFDEP 

SEG^MSIBg PVPAjpiR^GGDFgRSgA| 
iEWSSED^DEEP^sj^TQEPgLsgHg 

1660 




1670 1680 
.|....|....| 

jKS^OAKELbRCEKLDSKU^ID^R^Ki^LEDKEDNLCPVLKPKMTA 

jRTLgRDV ' - - -GCTR^qBvQT 

GTLQ^G JJCHSLDRG I S^K ^^^j^ KSS Q AKELLRCEKLD S KLAN I D^BrKKJ^LED KEDNLC P VLKPKMT A 
-J: -- ~----Q- 

GTLQjjjGLCHSLDRGISgKg^^ 

EASH& LLPRgs jEfl^ Epj ISHRDLKKQ'g 

1690 1700 1710 1720 1730 1740 1750 

— I — | — | — | — | — i — i — i — i - — JLi-i- 1 — l — I — l 

GSHECLPGNPVRPTGGQQEPPPASESRAFVSSTHAAQMSAVSFVPLKjAj^RiSDSGTEKPGLVAPESPVR 
E.DgTO G- 

GSHECLPGNPVRPTGGQQEPPPASESRAFVSSTKAAQMSAVSFVPLKAgjSRjjDSGTEKPGbVAPESPVR 
EE A 

GSHECLPGNPVRPTGGQQEPPPASESPJ^FVSSTHAAQMSAVSFVPIJQAgJHRjjDSGTEKPGLVAPESPVR 

-PVgSglflTG- 
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gi| 4589590 | 
gi jl4732014 
gi|3043646 
gi|2224547 
gij 6678958 
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gij 3043646 \ 
gi|2224547j 
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ijP VP P Ag LLg- - SgTfgjQ VGLTgR CgAEA VPgAGLTKi^G VgS PAPlgE-- 

TEiPPP?fiAKDli^PAAjnaHCSSPg7ffosB^ 



igPP PLjjAKDLggPAAgraHCS S PSjgAs£ 
ESsLSiQEVPDSGDRSgDI PCRAffiPLS- -gET 



TAEpgs sgQBp PKgVAAHSigs|HKPRjS3DPGP 

3llwk@qelggqqdhqdlal*t§dellkP't 



2180 



2190 



2200 



.[.... |. 



2210 
•I- 



2220 



2230 



pktkhpdrslssqkpsvgatkgkepatqslggssregkghsksgpdvfpatpgsqnkasdgigqgeggps 
pktkhpdrslssqkpsvgatkgkepatqslggssregkghsksgpdvfpatpgsqnkasdgigqgeggps 
pktkhpdrslssqkpsvgatkgkepatqslggssregkghsksgpdvfpatpgsqnkasdgigqgeggps 

2250 2260 2270 2280 2290 2300 2310 

vplhtdrapldakpqptsggrplevlekpvhlprpghpgpsepadqklsavgekqtlspkhpkpstvkdc 
vplhtdrapldakpqptsggrplevlekpvhlprpghpgpsepadqklsavgekqtlspkhpkpstvkdc 
vplhtdrapldakpqptsggrplevlekpvhlprpghpgpsepadqklsavgekqtlspkhpkpstvicdc 

2320 2330 2340 2350 2360 2370 2380 
I I 1 I I I I I I I I 1 1 I 

ptlckqtdnrqtdkspsqpaantdrraegkkctealyapaegdkleaglsfvhsenrlkgaerpaagvgk 
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gi 
gi 
gi 
gi 



14 732 014) PTLCKQTDNRQTDKSPSQPAANTDRRAEGKKCTEALYAPAEGDKIiEAGLSFVHSENRLKGAERPAAGVGK 

3043646 ' 

PTLCKQTDNRpTDKSPCQPAANTDRRAEGKKqTEALYAPAEGDKIiEAGLSFVHSENRLKGAERPAAGVGK 



2224547 
6678958 



NOV5 



gi 
gi 
gi 
gi 



45895901 
14732014 | 
3043646 | 
2224547 j 
6678958) 



' 2390 2400 2410 2420 2430 24,40 '2450 

....|....|....|....|....|....|,,..|..-.|....|....|....|....|.-..| I 

•GFPEARGKGPGPQKPPTEADKPNGMKRSPSATGQSSFRSTALPEKSLSCSSSFPETRAGVREASAASSDT 
GFPEARGKGPGPQKPPTEADKPNGMKRSPSATGQSSFRSTALPEKSLSCSSSFPETRAGVREASAASSDT 
GFPEARGKGPGPQKPPTEADKPNGMKRSPSATGQSSFRSTAIjPEKSLSCSSSFPETRAGVREASAASSDT 



N0V5 

gi | 4589590| 
gij 14732014 
gi | 3043646 
gij 2224547 
gij 6678958 



gi 
gi 
gi 
gi 



4589590 ] 
14732014] 
3043646] 
2224547 j 
6678958 j 



2460 2470 ,2480 2490 2500 2510 2520. 

SSAKAAGGMLELPAPSNRDHRKAQPAGEGRTHMTKSDSLPSFRVSTLPLESHHPDPNTMGGASHRDRAL.S 
( SSAKAAGGMLELPAPSNRDHRKAQPAGEGRTHMTKSDSLPSFRVST3jPLESHHPDPNTMGGASHRDRAL»S 
SSAKAAGGMLfeLPAPSNRDHRKAQPAGEGRTHMTKSDSLPSFRVSTLPLESHHPDPNTOGGASHRDRALS 

2530 2540 2550 2560 2570 2580 2590 

I I ! I I 1 1 I I I i ! I — 1 

■ VTATVGETKGKDPAPAQPPPARKQMVGRDVTKPSPAPNTDRPISLSNEKDFWRQRRGKESbRSSPHKKA 

VTATVGETKGKDPAPAQPPPARKQWVGRDVTKPSPAPKTDRPISLSNEKDFWRQRRGKESLRSSPHKKA 

VTATVGETKGKDPAPAQPPP'ARKQNVGRDVTKPSPAPNTDRPISLSNEKDFWRQRRGKESljRSSPHKKA 

r J n 



N0V5 

gi 
gi 
gi 
gi 
gi 



4589590| 
14732014 | 
3043646| 
2224547 j 
6678958 j 



Tables 5E - 51 list the domain description from DOMAIN analysis results against 
NOV5. This indicates that the NOV5 sequence has properties similar to those of other 
proteins known to contain these domains. 



Table 5E Domain Analysis of NOV5 



qnl 1 Smart | smart 00 22 0 , SJTKc, Serine/Threonine protein kinases, 
catalytic domain; Phosphotransferases. Serine or threonine- specif ic 
kinase subfamily. (SEQ ID NO: 83) 
Length = 256 residues, 100.0% aligned 

Score = 286 bits (731) , Expect « le-77 



NOV5 ; 


492 


00220: 


1 


NOV5 : 


552 


00220: 


60 


NOV5 : 


612 


00220 : 


120 


NOV5 : 


672 


00220: 


165 


NOV5: 


731 


00220 : 


225 



FETI Khl SNGA YGAV YFVRHKESRQRFAMKK I NKQNL I LRNQI QQAFVERDI LTFAENPF 551 

+1 +++ + IN IT I I++ + l+l I 1+ I + + + + III ++I 
YELLEVLGKGAFGKVYLARBKKTGKLVAI KVI KKEKLK- KKKRERI LRE I Kl LKKLDHPN 5 9 

WSMYCSFETRRHLCWMEYVEGGDCATLMK1SMGPLP 611 

+i +i ii i +IIU ill! i+i ii i 1 1 i + + mm* n 

I VKL YDVFEDDDKLYLVME YCEGGDLFDhLKKRGRhSEDEARFYARQJLSALE YhHSQGI 119 
VHRDLKPDSLLVTSMGHIKLTDFGLSKVGLMSMTTNLYEGHIEKDAREFLDKQVCGTPEY 671 

+HI111+++1+ i 11+11 im+i + i+ i inn 

IHRDLKPENILLDSDGHVKLADFGLAK QLDSGGTLLTTFVGTPEY 164 

IAPEVILRQGYGKPVDWWAMGIILYEFLVGCVPFFG-DTPEELFGQVISDEINWPEKDEA 73 0 

+ IMI + I + 1)1! II I++I+III) II 1 1 II II ++ + 1 + 

MAPEVLLGKGYGKAVDIWSLGVILYELLTGKPPFPGDDQLLALFKKIGKPPPPFPPPEWK 224 

PPPDAQDLITLLLRQNPLERLGTGGAYEVKQHRFF 765 

l + l+lll II ++I +11 M +MI 

ISPEAKDLIKKLLVKDPEKRL- - - TAEEALEHPFF 256 
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Table 5F Domain Analysis of NOV5 

gnl ] Pf am | p£ amO 0069 , pkinase, Protean kinase domain. (SEQ ID *NO:84) 
Lengbh = 256 residues, 100.0% aligned 

Score « 222 bits (565) , Expect = 2e-58 



NOV5 : 


492 


FETIKLISNGAYGAVYFVRHKESRQRFAMKKINKQNLILRNQIQQAFVERDILTFAENPF 


551 






+| + + +| |+| | | +| |++ + |+|+ |++| + + + III +| 




00069: 


1 


YELGEKLGSGAFGKV YKGKHKDTGE I VAI KI LKKRSLSEKKK - - RFLRE I QI LRRLSHPN 


58 


NOV5: 


552 


WSMYCS FETRRHLckvMEYVEGGDCATLMKNMGP - LPVDMARMYFAETVLALEYLHNYG 


610 






+ 1 + II II +1111+1111 ++ 1 1 1+ + + llllk 1 




00069: 


59 

i 


IVRLLGVFEEDDHLYLVMEYMEGGDLFDYLRRNGLLLSEKEAKKIALQILRGLEYLHSRG 


118 


NOV5: 


611 


I VHRDLKPDSLLVTSMGfc I KLTDFGhS KVGLMSMTTNL YEGHI EKDAREFLDKQVCGTPE 


670 






11111111+++!+ 1 +1+ IIII++ +1+11 llll 




00069: 


113 


IVHRDLKPENILLDENGTVKIADFGLARK LESSSYEKL- TTFVGTPE 


164 


NOV5: 


671 


YIAPEVILRQGYGKPVDWWAMGIILYEFLVGCVPFFGDTPEELFGQVISDEINWPEKDEA 


730 






1+IIII+ +11 II I++I + IIH 1 l+ll III +.+ 




00069: 


165 


YMAPEVLEGRGYSSKOTVWSLGVILYELLTGKLPFPGIDPLEELFRIKERPRLRLPLPPN 


224 


NOV5 : 


731 


P PPDAQDL I TLLLRQNPLERLGTGG AYEVKQHRFF 765 








+ +||| | '++[ +( | |+ ! +| 




00069: 


225 


CSEELKDLIKKCLNKDPEKRP TAKE I LNHPWF 256 







Table 5G Domain Analysis of NOV5 

gnl j Smart | smart 00219 , TyrKc, Tyrosine kinase, catalytic domain,- 
Phosphotransferases. Tyrosine- specif ic kinase subfamily. (SEQ ID 
NO: 85) 

Length = 258 residues, 94.6% aligned 
Score « 110 bits (275) , Expect - le-24 


NOV5 : 


496 


KLISNGAYGAVY FVRHKESRQRFAMKKINKONLILRNQI QQAFVERDI LTFAENPFV 


552 






| + | |+| | | |+| + + | |++ | ++ ++| + 




00219: 


5 


KKLGEGAFGEVYKGTLKGKGGVEVEVAVKTLKEDASE- -QQ I EE FLREARLMRKLDHPNI 


62 


NOV5: 


553 


VSMYCSFETRRHLCMVMEYVEGGDCATLMKNMGPLPVDM^^ 


610 






| + | +IIII+IHI ++ | + ++ + || + +||| + 




00219: 


63 


VKLLGVCTEEEPLMIVMEYMEGGDLLDYLRKNRPKELSLSDLLSFALQIARGMEYLESKN 


122 


NOV5 : 


611 


IVHRDLKPDSLLVTSMGHIKLTDFGLSKVGLMSMTTNL YEGHI EKDAREFLDKQVCGTPE 


670 






mil + ii +i+ mi++ +n+ + +i 




00219: 


123 


FVHRDLAARNCLVGENKTVKI ADFGLAR DLYDDD YYRKKKS PR 


165 


NOV5: 


671 


YI APEVILRQGYGKPVDWWAMGI ILYEFLVGC- VPFFGDTPEELFGQVI SDEINWP 


725 






++III + 11+ I++I+1 1+ 1 + 11+ + 




00219: 


166 


LPIRWMAPESLKDGKFTSKSDVWSFGVLLWEIFTLGESPYPGMSNEEVLEYLKKGYRLPQ 


225 


KOV5: 


726 


EKDEAPPPDAQDLITLLLRQNPLER 7 50 








+ 1 + 1 1+ + +| +| 




00219: 


226 


PPN--CPDEIYDLMLQCWAEDPEDR 24 8 
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Table 5H Domain Analysis of NOV5 

gnl | Smart f smart 00228 , PDZ, Domain present in PSD-95, Dig, and ZO-l/2.; 
Also called DHR (Dig homologous region), ,or GLGF ( (relatively well 
conserved tetrapeptide in these domains) . Some PDZs have been shown to 
bind O terminal polypeptides; others appear to bind internal (non^O 
terminal) polypeptides. Different PDZs possess different binding 
specificities. (SEQ ID NO: 86) 
Length = 86 residues, 89.5% aligned 

Score = 73.2 bits (178), Expect = 2e-13 



N0V5: 1064 IVIHSSGKNYGFTIRAIRVYVGDSDI YTVHHIVWNVEEGSPACQAGLKAGDLITHINGEP 1123 

+ + I II++ II I +1-1 Mil +1111 ll+l -II 

00228: 5 VEDEKGGGGLGFSL VGGKDSGDGGWVS S WPGSP AAKAGLKPGDVI LEVNGTS 58 

N0V5: 1124 VHGLVHTEVI ELLL^SGNKVS I T 1146 

I II I I ++II ++I II++I 
,00228; 59 VEGLTHLEAVDhljKEAGGKVThT 81 



Table 51 Domain Analysis of NOV5 

gnl j Pf am|pf am005 95 , PDZ, PDZ domain (Also known as DHR or GLGF) . , 
domains are found in diverse signaling proteins. (SEQ ID NO: 87) 
Length = 81 residues, 96.3% aligned 
Score = 53.5 bits (127), Expect = le-07 



PDZ 



N0V5 ; 1064 IVIHSSGKWY-GFTIRAIRVYVGDSDIYTVHHIVWNVEEGSPACOAGLKAGDLITHINGE 1122 

+ + |+ ||++ III +1 I I I + III II I 111+ 

00595: 2 VTLERQGRGGLGFSL -KGGSDKGDQGIWSEVXPGGAAERGGLKEGDRILEINGQ 55 

NOV5: 1123 P VHGL VHTE VI ELLLKSGNKVSI T 1146 

■ I + I +111 + I++I 
00595: 56 DVENVTHERAVLALKGSGGEVTLT 79 

5 The NOV5 gene encodes for a protein that is a serine/tlireonine kinase with homology 

to an intracellular kinase MAST205. MAST205 was initially identified as a kinase associated 
with spermatid microtubules and shown to be more active during spermio genesis. Recently, 
interaction of MAST205 with beta-syntrophin via their PDZ domains, and localization of these 
two proteins at the neuromuscular junction was demonstrated. It is, therefore, likely that the 

10 protein encoded by the gene of invention has similar functions. 

To identify proteins which interact with and potentially modulate the function of 
microtubules during spermatogenesis, a total testis MAP (microtubule-associated protein) 
antiserum was prepared and used it to isolate cDNA clones from a mouse testis cDNA 
expression library. Antibodies affinity purified by using one expression clone recognized a 

1 5 205-kDa protein, termed MAST205, which colocalizes with the spermatid manchette. 
Sequencing of full-length cDNA clones encoding MAST205 revealed it to be a novel 
serine/threonine kinase with a catalytic domain related to those of the A and C families. The 
testis-specific M AST205 RNA increases in abundance during prepuberal testis development, 
peaking at the spermatid stage. The microtubule-binding region of MAST205 occupies a 
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central region of the molecule including the kinase domain and sequences C terminal to this 
domain. Binding of MAST205 to microtubules requires interaction with other MAPs, since it • 
does not bind to MAP-free tubulin. A 75-kDa protein associated with immunoprecipitates of 
MAST205 from extracts of both whole testis and testis microtubules becomes phosphorylated 
5 in in vitro kinase assays. This 75-kDa substrate of the MAST205 kinase may form part of the 
M AST205 protein complex which binds microtubules. The MAST205 protein complex may 
function to link the signal transduction pathway with the organization of manchette 
microtubules (Walden and Cowan. A novel 205-kilodalton testis- specific serine/threonine 
protein kinase associated with microtubules of the spermatid manchette. Mol Cell Biol 

10 i3(12):7625-35, 1993). 

The morphological and biochemical changes that occur in the haploid male germ cell 
1 during spermiogenesis facilitate the natural delivery of the paternally imprinted chromosomes 
into oocytes. Despite the obvious morphological changes, little is known about the molecular 
events underlying spermiogenesis. A novel 205 -kDa manchette microtubule-associated 

15 serine/threonine protein kinase (MAST205) was cloned from, mouse testis. The objective of 
the study was to further delineate the role of MAST205 in mammalian spermiogenesis. While 
MAST205 RNA levels were similar in pachytene spermatocytes, round spermatids, and 
residual bodies, MAST205 protein could be detected only in round spermatids and residual 
bodies. Kinase activity associated with MAST205 immunoprecipitates was low in pachytene 

20 spermatocytes, high in round spermatids, and maximal in residual bodies, indicating that 

MAST205-associated kinase activity is modified during spermatid maturation. Furthermore, 
MAST205 protein and the associated kinase activity were not detected in epididymal 
spermatozoa, indicating that MAST205 protein is either excluded from, or degraded in, the 
latter cell type. Multiple heterologous protein species were seen in immunoprecipitates from 

25 35S-labeled mouse seminiferous tubules using an affinity-purified MAST205 antiserum. 
Consistent with this observation, MAST205 eluted as part of a 1-2 x 10(6) dalton protein 
complex when extracts of mouse testis were fractionated by Superose 6 column 
chromatography. MAST205 mRNA was detected in human testis indicative of conservation in 
other mammalian species. Taken together, these results indicate that the MAST205 complex 

30 functions in spermatid maturation in mammals (Walden and Millette, Increased activity 

associated with the MAST205 protein kinase complex during mammalian spermiogenesis. 
Biol Reprod 55(5): 1039-44, 1996). 

A screen for proteins that interact with beta 2-syntrophin led to the isolation of 
MAST205 (microtubule-associated serine/threonine kinase-205 kD) and a newly identified 
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homologue, SAST (syntrophin-associated serine/threonine kinase). Binding studies showed 
that beta 2-syntrophin aricl MAST205/SAST associated via a PDZ-PDZ domain interaction. 
MAST205 colocalized with beta 1 2-syntrophin and utrop'hin at neuromuscular junctions. SAST 
colocalized with syntrophin in cerebral vasculature, spermatic acrosomes and neuronal 
5 processes. SAST and syntrophin were highly associated with purified microtubules and 
microtubule-associated proteins, whereas utrophin and dystrophin were only partially 
associated with microtubules. Data suggest that MAST205 and SAST link the ' . 

dysti'ophin/utrophin network with microtubule filaments via the syntrophins (Lumeng et al., 
Interactions between beta 2-syntrophin and a family of microtubule-associated 

10 serine/threonine kinases. Nat Neurosci 2(7):61 1-7, 1999). 

The above defined information for NOV5 suggests that this NOV5 protein may ' 
function as a member of a MAST205 protein family. Therefore, the' NOV5 nucleic acids and 
proteins of the invention are useful in potential therapeutic applications implicated in various 
diseases and disorders described below and/or other pathologies. For example, the NOV5 

15 compositions of the present invention will have efficacy for treatment of patients suffering 
from atherosclerosis, aneurysm, hypertension, fibromuscular dysplasia, stroke, scleroderma, 
obesity, transplantation, adrenoleukodystrophy , congenital adrenal hyperplasia, Von Hippel- 
Lindau (VHL) syndrome, Alzheimer's disease,- tuberous sclerosis, hypercalcemia, Parkinson's 
disease, Huntington's disease, cerebral palsy, epilepsy, Lesch-Nyhan syndrome, multiple 

20 sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral disorders, addiction, anxiety, 

pain, neurodegeneration, hemophilia, hypercoagulation, idiopathic thrombocytopenic purpura, 
allergies, immunodeficiencies, transplantation, graft versus host disease, systemic lupus 
erythematosus, autoimmune disease, asthma, emphysema, scleroderma, ARDS, 
cardiomyopathy, congenital heart defects, aortic stenosis, atrial septal defect (ASD), 

25 atrioventricular (A-V) canal defect, ductus arteriosus, pulmonary stenosis, subaortic stenosis, 
ventricular septal defect (VSD), valve diseases, scleroderma, pancreatitis, diabetes, 
hypercalcemia, ulcers, fertility, inflammatory bowel disease, diverticular disease, 
lymphedema, endocrine dysfunctions, growth and reproductive disorders, psoriasis, actinic 
keratosis, acne, hair growth/loss, allopecia, pigmentation disorders, renal artery stenosis, 

30 interstitial nephritis, glomerulonephritis, polycystic kidney disease, renal tubular acidosis, IgA 
nephropathy, muscular dystrophy, myasthenia gravis, cancer,trauma, regeneration (in vitro and 
in vivo) and/or viral/bacterial/parasitic infections. The NOV5 nucleic acid encoding the 
MAST205-like protein, and the MAST205-like-like protein of the invention, or fragments 

50 

BNSDOCID: <WO„02055704A2_I_> 



WO 02/055704 



PCT/US02/00554 



10 



15 



20 



thereof, may further be useful in diagnostic applications, wherein the presence or amount of 
the nucleic acid or the protein are to be assessed. 

NOV6 

A disclosed NOV6 nucleic acid of 1 1 19 nucleotides (also referred to as CG56087-01) 
encoding a novel KILON-Jike protein is shown in Table 6A. An open reading frame was 
identified beginning with an ATG initiation codon at nucleotides 17-19 and ending witji a 
TAA codon at nucleotides 1079-1081. Putative untranslated regions upstream from the start 
codon and downstream from the termination codon are underlined in Table 6A, and the start 
and stop codons are in bold letters. 



Table 6A. NOV6 Nucleotide Sequence (SEQ ID NO:13) 



AGCAGCCCTAGCAGGG ATGGACATGATGCTGTTGGTGCAGGGTGCTTGTTGCTCGAACCAGTGGCTGGCGGCGGTGCTCCTC 
AGCCTGTGC TG CC TGCTACCCTC CTGC CTCCCGGCTGG AC AGAGTGTGGACTTCC CCTGGG CGG CCGTGG ACAAC ATG ATGG 
TCAGAAAAGGGGACACGGCGGATTTGAGGTGTTATTTGGAAGATGGAGCTTCAAAGGGTGCCTGGCTGAACCGGTCAAGTAT 
TATTTTTGCGGGAGGTGATAAGTGG^CAGTGGATCCTCGAGTTTCAATTTCAACATTGAATAAAAGGGACTACAGCCTCCAG 
ATACAGAATGTAGATGTGACAGATGATGGCCCATACACGTGTTCTGTTCAGACTCAACATACACCCAGAACAATGCAGGTGC 
ATCTAACTGTGCAAGTTCCTCCTAAGATATATGACATCTCAAATGATATGACCGTCAATGAAGGAACCA^CGTCACTCTTAC 
TTGTTTGGCCACTGGGAAACCAGAGCCTTCCATTTCTTGGCGACACATCTCCCCATCAGCAAAACCATTTGAAAATGGACAA 
TATTTGGACATTTATGGAATTACAAGGGACCAGGCTGGGGAATATGAATGCAGTGCGGAAAATGATGTGTCATTCCCAGATG 
TGAGGAAAGTAAAAGTTGTTGTCAACTTTGCTCCTACTATTCAGGAAATTAAATCTGGCACCGTGACCCCCGGACGCAGTGG 
CCTGATAAGATGTGAAGGTGCAGGTGTGCCGCCTCCAGCCTTTGAATGGTACAAAGGAGAGAAGAAGCTCTTCAATGGCCAA 
CAAGGAATTATTATTCAAAATTTTAGCACAAGATCCATTCTCACTGTTACCAACGTGACACAGGAGCACTTCGGCAATTATA 
CTTGTGTGGCTGCCAACAAGCTAGGCACAACCAATGCGAGCCTGCCTCTTAACCCTCCAAGTACAGCCCAGTATGGAATTAC 
CGGGAGCGCTGATGTTCTTTTCTCCTGCTGGTACCTTGTGTTGACACTGTCCTCTTTCACCAGCATATTCTACCTGAAGAAT 
GCCATTCTACAATAAATTCAAAGACCCATAAAAGGCTTTTAAGGATTCTCTGA 



The NOV6 nucleic acid was identified on chromosome 1 and has 95 1 of 1 085 bases 
(87%) identical to a Rattus norvegicus Kilon mRNA (gb: GENBANK- 
ID:AB017139|acc:AB017139.1) (E= 3.6e" 196 ). 

A disclosed NOV6 polypeptide (SEQ ID NO: 14) encoded by SEQ ID NO:13 is 354 
amino acid residues and is presented using the one-letter code in Table 6B. Signal P, Psort 
and/or Hydropathy results predict that NOV6 contains a signal peptide and is likely to be 
localized extracellularly with a certainty of 0.8200 and localized in the lysosome (lumen) with 
a certainty of 0.4990. The most likely cleavage site for a NOV6 peptide is between amino 
acids 33 and 34, at: CLP-AG. 



Table 6B. Encoded NOV6 protein sequence (SEQ ID NO:14) 



MDMMLLVQGACCSNQWLAAVLLSLCCLLPSCL.PAGQSVDFPV?AAVDNMMVRKGDTADLRCYLEDGASKGAWLNRSSIIFAG 
GDKWSVD PRVS I STLNKRDYSLQI QNVDVTDDGP YTCS VQTQHTPRTMQVHLTVQVPPKI YD I SNDMTWEGTNVTL.TCLA 
TGKPEPSISWRHISPSAKPFENGQYLDIYGITRDQAGEYECSAENDVSFPDVRKVKVWNFAPTIQEIKSGTVTPGRSGLI 
RCEGAGVPPPAFEWYKGEKKLFNGQQGIIIQNFSTRSILTV^rNVTQEHFGNYTCVAANKLGTTNASLPLNPPSTAQYGITG 
SADVLFSCWYLVLTLSSFTSIFYLKNAILQ 



The NOV6 amino acid sequence has 333 of 352 amino acid residues (94%) identical 
to, and 340 of 352 amino acid residues (96%) similar to, a Rattus noi-vegicus 348 amino acid 
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residue Kilon protein precursor (kindred of iglon) (pto:SWISSPROT-ACC:Q9Z0J8) (E - 
- 7.6e 181 ). 

NOV 6 is expressed in at least the following tissues: brain, .retina and hair follicles. This 
information was derived by determining the tissue sources of the sequences that were included 
5 in the invention including but not limited to SeqCalling sources, Public EST sources, 

Literature sources, and/or RACE sources. In addition, NOV6 is predicted to be expressed in 
brain tissues because of the expression pattern of a .closely related Rattiis norvegicus Kilon 
mRNAhomolog (GENBANK-ID: gb:GENBANK-ID:AB017139|acc:AB017139.1). 

Possible small nucleotide polymorphisms (SNPs) found for NOV6 are listed in Table 

10 tec. 



Table 6C: SNPs 


Variant 


Nucleotide 


Base 


Amino 


Base 




Position 


Change 


Acid 


Change 








Position 




13376475 


79 


C>T 


Silent 


N/A 


13376474 


330 


T>C 


105 


He > Thr 


13376473 


336 


A > G 


J 07 


Asn > Ser 


13376472 


472 


A> G 


Silent 


N/A 


13376471 


764 


G > A 


250 


Val > Met 



NOV6 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 6D. 



Table 6D. BLAST results for NOV6 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi | 11067409 | ref |n 


Kilon [Rattus 
norvegicus] 


348 


319/352 
(90%) 


326/352 
(91%) 


0 . 0 


P 067714. 1| 
(NM 021682) 


gi|5019445|emb)CA 


neurotractin-L 
[Gallus gallus] 


352 


276/351 
(78%) 


302/351 
(85%) 


e-162 


344446 . 1 | 
(AJ132999) 


gi | 5019443 |emb|CA 


neuro tract in- S 
[Gallus gallus] 


261 


169/226 
(74%) 


185/226 
(81%) 


2e-94 


B44445.1 | 
(ACT132998) 


gi | 2497324 | sp|Q62 


LIMBIC SYSTEM- 
ASSOCIATED 
MEMBRANE 
PROTEIN 
PRECURSOR 
(LSAMP) [Rattus 
norvegicus] 


338 


172/305 
(56%) 


220/305 
(71%) 


3e-91 


813 I LAMP RAT 


gi|4505025|ref |NP 


limbic system- 
associated 

membrane 
protein [Homo 

sapiens] 


338 


170/291 
(58%) 


216/291 
(73%) 


4e-90 


002329. 1 | 
(NM_002338) 
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The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 6E. 

i ■ , r 

Table 6E Information, for the ClustalW proteins 

1) NOV6(SEQIDNO:14) 

2) gilll0674091reflNP 067714.11 (NMJ)21682) Kilon [Rattus norvegicus] (SEQ ID NO:88) 

3) gi|50 1 9445 |emblCAB44446. 1 1 (AJ132999) neurotractin-L [Gallus gallus] (SEQ IDNO:89) 

4) gflSOl 9443lemblCAB4444S. 1 1 (AJ132998) ncurotractin-S [Gallus gallus] (SEQ ID NO:90) 

5) gj|24973241sp|Q628131LAMP RAT LIMBIC SYSTEM-ASSOCIATED MEMBRANE PROTEIN 
PRECURSOR (LSAM?) [Rattus norvegicus] (SEQ ID NO:91) 1 , 

6) gil4505025lref1NP Q02329.1 1 (NMJ)02338) limbic system-associated membrane protein [Homo sapiens} 
(SEQ ID NO:92) 



NOV6 

gi J 11067409) 



gi 



5019445 
5019443 
2497324 
4505025 




JL;J 






eTjT? 




< 


% 


Si 










Ml 

21 




n 



NOV6 

gi|ll067409| 
gi j 5019445 | 
gi j 5019443 J 
gi j 2497324 j 
gi [4505025] 



NOV6 

gi|H067409| 
gij 5019445 | 
gi] 5019443] 
gi |2497324 [ 
gi |4505025 | 



NOV6 



gi 
gi 
gi 
gi 
gi 



gi 
gi 
gi 
gi 
gi 



11067409} 
5019445] 
5019443 | 
2497324 j 
4505025 



11067409] 
5019445 | 
5019443 j 
2497324 | 
4505025] 




280 
J. 



270 

KG;EKI<LFNGQMl[]ONFST]& 
I^GteRkLI SGQ^XTgKNYSTg 




350 



T|VT| EHHGfflYTCVAflffiKIiGffiT|jAS '. 

Tmrm ehSgSytcvaa BklgBtSas: 



NOV6 

gi|H067409| 
gi | 5019445 | 
gi}5019443] 
gij2497324| 
gij4505025j 
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Tables 6F and 6G list the domain description from DOMAIN analysis results against 
NOV6. This indicates that the NOV6 sequence has properties similar to those of other 
proteins known to contain these domains. ' '■ 

Table 6F Domain Analysis of NOV6 

gnl [smart | smart00409 / IG, Immunoglobulin (SEQ ID NO: 93) 
Length = 86 residues, 95.3% aligned 

Score = 61.2 bits (147), Expect =• 9e-ll . ■_ 

NOV6: 231 KSGTVTPGRSGLIRCEGAGVPPPAFEWYKGEKKLKNGQQGI I IQNFSTRSILTVTNVTQE 290 

i ii i i + ii +i iii in ii + i ii++!ii r. 

0 04 09: 2 PSVTVKEGESVTLSCEASGNPPPTVTWYKQGGKLLAESGRFSVSRSGGNSTLTISNVTPE 61 

I 

N0V6: 291 HFGNYTCVAANKLGTTNASLPL 312 

I I I I I I 1+ + + ■ I 
00409: 62 DSGTYTCAATNS SGSASSGTTL 83 

5 

Table 6G Domain Analysis of NOV6 

gnl 1 Smart 1 smart 004 08 , IGc2, Immunoglobulin C-2 Type (SEQ ID WO: 94) 
Length - 63 residues, 100.0% aligned 

Score .•= 60.5 bits (145) , Expect = 2e-10 

NOV6: 151 NEGTNVTLTCLATGKPEPS1SWRHIS PSAKPFENGQYLpi YGITRDQAGEYECSAEN 2 07 

II +1 I I I I 1 + 1 I kl + l I ++ +1 II ++ + +1 I I I I 

00408 : 1 LEGESVTLTCPASGDPVPNITWLKDGKPLPESRWASGSTLTIKNVSLEDSGLYTCVARN 60 

NOV6: 208 DVS 210 
I 

00408: 61 SVG 63 

The NOV6 gene encodes for a protein that has high homology to the IgLON adhesion 
molecules. This is a subfamily of the immunoglobulin superfamily that comprises of proteins 

10 such as LAMP, OBCAM and neurotrimin. These proteins are proteins anchored by a 
glycosyphosphatidylinositol anchor that are involved in cell adhesion and participate in 
remodeling of neurons. The NOV6 gene shows the presence of three immunoglobulin domains 
that participate in protein-protein interactions and are the hallmark of this family of proteins. 
In the central nervous system, many cell adhesion molecules are known to participate 

15 in the establishment and remodeling of the neural circuit. Some of the cell adhesion molecules 
are known to be anchored to the membrane by the glycosylphosphatidylinositol (GPI) inserted 
to their C termini, and many GPI-anchored proteins are known to be localized in a Triton- 
insoluble membrane fraction of low density or so-called "raft." In a study, the GPI-anchored 
proteins were surveyed in the Triton-insoluble low density fraction from 2- week-old rat brain 

20 by solubilization with phosphatidylinositol-specific phospholipase C. By Western blotting and 
partial peptide sequencing after the deglycosylation with peptide N-glycosidase F, the 
presence of Thy-1, F3/contactin, and T-cadherin was shown. In addition, one of the major 
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proteins, having an apparent molecular mass of 36 kDa after the peptide N-glycosidase F 
digestion, was found to be a novel protein. The result of cDNA cloning showed that the 
protein is an immunoglobulin superfamily member with three C2 domains and has six putative 
glycosylation sites. Since this protein shows high sequence similarity to IgLON family 
5 members including LAMP, OBCAM, neurotrimin, CEPU-1, AvGP50, and GP55, we termed 
the protein Kilon (a kindred of IgLON). Kilon-specific monoclonal antibodies were produced, 
and Western blotting analysis showed that expression of Kilon' is restricted to brain, and Kilon 

has an apparent molecular mass of 46 kDa in SDS-polyacrylamide gel electrophoresis in its 

i 

expressed form. In brain, the expression of Kilon is already detected in El 6 stage, and its level 

1 0 gradually increases during development. Kilon immunostaining was observed in the cerebral 
cortex and hippocampus, in which the strongly stained puncta were observed on dendrites and 
soma of pyramidal neurons (Funatsu et aL, Characterization of a novel rat brain 
glycosylphosphatidylinositol-anchored protein (Kilon), a member of the IgLON cell adhesion 
molecule family. J Biol Chem 274(l2):8224-30 9 1999). 

15 The formation of axon tracts in nervous system histogenesis is the result of selective 

axon fasciculation and specific growth cone guidance in embryonic development. One group 
of proteins implicated in neurite outgrowth, fasciculation, and guidance is the neural members 
of the Ig superfamily (IgSF). In an attempt to identify and characterize new proteins of this 
superfamily in the developing nervous system, a PCR-based strategy was used with 

20 degenerated primers that represent conserved sequences around the characteristic cysteine 

residues of Ig-like domains. Using tins approach, a novel neural IgSF member was identified, 
termed neurotractin. This GPI-Iinked cell surface glycoprotein is composed of three Ig-like 
domains and belongs to the IgLON subgroup of neural IgSF members. It is expressed in two 
isoforms with apparent molecular masses of 50 and 37 kD, termed L-form and S-form, 

25 respectively. Monoclonal antibodies were used to analyze its biochemical features and 

histological distribution. Neurotractin is restricted to subsets of developing commissural and 
longitudinal axon tracts in the chick central nervous system. Recombinant neurotractin 
promotes neurite outgrowth of telencephalic neurons and interacts with the IgSF members 
CEPU-1 (KD = 3 x 10(-8) M) and LAMP. Data suggest that neurotractin participates in the 

30 regulation of neurite outgrowth in the developing brain (Marg et al., Neurotractin, a novel 
neurite outgrowth-promoting Ig-3ike protein that interacts with CEPU-1 and LAMP. J Cell 
Biol 145(4):865-76, 1999). 

The above defined information for NOV6 suggests that NOV6 may function as a 
member of a KILON protein family. Therefore, the NOV6 nucleic acids and proteins of the 
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invention are useful in potential therapeutic applications implicated in various diseases and 
disorders described beldw and/or other pathologies: For example, the NOV6 compositions of 
the present invention will have efficacy for treatment of patients suffering from Von Hippel- 
Lindau (VHt) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, hypercalcemia, 
5 Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch-Nyhan syndrome, 
multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral disorders, addiction, 
anxiety, pain, neurodegeneration, diabetes, psoriasis, actinic keratosis, acne, hair groWth/loss, 
allopecia, pigmentation disorders, endocrine disorders, cancer, trauma and/or 
viral/bacterial/parasitic infections. They may also be used for cell or tissue regeneration (in 
1 0 vitro or in vivo). The NOV 6 nucleic acid encoding KILON-like protein, and the KILON-like 
protein of the invention, or fragments thereof, may further be useful in diagnostic applications, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed. 



NOV7 

1 

A disclosed NOV7 nucleic acid of 3092 nucleotides (also referred to CG56071-01) 
15 encoding a novel mixed lineage kinase 2-like protein is shown in Table 7A. An open reading 
frame was identified beginning with an ATG initiation codon at nucleotides 1-3 and ending 
with a TAG codon at nucleotides 3073-3075. A putative untranslated region downstream 
from the termination codon is underlined in Table 7A, and the start and stop codons are in 
bold letters. 



Table 7A. NOV7 Nucleotide Sequence (SEQ ID NO:15) 

ATGGAGCCCTCCAGAGCGCTTCTCGGCTGCCTAGCGAGCGCCGCCGCTGCCGCCCCGCCGGGGGAGGATGGAGCAGGGGC 
CGGGGCCGAGGAGGAGGAGGAGGAGGAGGAGGAGGCGGCGGCGGCGGTGGGCCCCGGGGAGCTGGGCTGCGACGCGCCGC 
TGCCCTACTGGACGGCCGTGTTCGAGTACGAGGCGGCGGGCGAGGACGAGCTGACCCTGCGGCTGGGCGACGTGGTGGAG 
GTGCTGTCCAAGGACTCGCAGGTGTCCGGCGACGAGGGCTGGTGGACCGGGCAGCTGAACCAGCGGGTGGGCATCTTCCC 
CAGCAACTACGTGACCCCGCGCAGCGCCTTCTCCAGCCGCTGCCAGCCCGGCGGCGAAATTGATTTTGCGGAGCTCACCT 
TGGAAGAGATTATTGGCATCGGGGGCTTTGGGAAGGTCTATCGTGCTTTCTGGATAGGGGATGAGGTTGCTGTGAAAGCA 
GCTCGCCACGACCCTGATGAGGACATCAGCCAGACCATAGAGAATGTTCGCCAAGAGGCCAAGCTCTTCGCCATGCTGAA 
GCACCCCAACATCATTGCCCTAAGAGGGGTATGTCTGAAGGAGCCCAACCTCTGCTTGGTCATGGAGTTTGCTCGTGGAG 
GACCTTTGAATAGAGTGTTATCTGGGAAAAGGATTCCCCCAGACATCCTGGTGAATTGGGCTGTGCAGATTGCCAGAGGG 
ATGAACTACTTACATGATGAGGCAATTGTTCCCATCATCCACCGCGACCTTAAGTCCAGCAACGTATTGATCCTCCAGAA 
GGTGGAGAATGGAGACCTGAGCAACAAGATTCTGAAGATCACTGATTTTGGCCTGGCTCGGGAATGGCACCGAACCACCA 
AGATGAGTGCGGCAGGGACGTATGCTTGGATGGCACCCGAAGTCATCCGGGCCTCCATGTTTTCCAAAGGCAGTGATGTG 
TGGAGCTATGGGGTGCTACTTTGGGAGTTGCTGACTGGTGAGGTGCCCTTTCGAGGCATTGATGGCTTAGCAGTCGCTTA 
TGGAGTGGCCATGAACAAACTCGCCCTTCCTATTCCTTCTACGTGCCCAGAACCTTTTGCCAAACTCATGGAAGACTGCT 
GGAATCCTGATCCCCACTCACGACCATCTTTCACGAATATCCTGGACCAGCTAACCACCATAGAGGAGTCTGGTTTCTTT 
GAAATGCCCAAGGACTCCTTCCACTGCCTGCAGGACAACTGGAAACACGAGATTCAGGAGATGTTTGACCAACTCAGGGC 
CAAAGAAAAGGAACTTCGCACCTGGGAGGAGGAGCTGACGCGGGCTGCACTGCAGCAGAAGAACCAGGAGGAACTGCTGC 
GGCGTCGGGAGCAGGAGCTGGCCGAGCGGGAGATTGACATCCTGGAACGGGAGCTCAACATCATCATCCACCAGCTGTGC 
CAGGAGAAGCCCCGGGTGAAGAAACGCAAGGGCAAGTTCAGGAAGAGCCGGCTGAAGCTCAAGGATGGCAACCGCATCAG 
CCTCCCTTCTGGTTTCCAGCACAAGTTCACGGTGCAGGCCTCCCCTACCATGGATAAAAGGAAGAGTCTTATCAACAGCC 
GCTCCAGTCCTCCTGCAAGCCCCACCATCATTCCTCGCCTTCGAGCCATCCAGTGTGAGACTGTTTCCCAAATTAGCTGG 
GGCCAGAACACACAGGGGCACCTGTCCGAAAGCAGCAAAACCTGGGGCAGGAGCTCAGTCGTCCCAAAGGAGGAAGGGGA 
GGAGGAGGAGAAGAGGGCCCCAAAGAAGAAGGGACGGACGTGGGGGCCAGGGACGCTTGGTCAGAAGGAGCTTGCCTCGG 
GAGATGAACTCAAGTCCCTGGTAGATGGATATAAGCAGTGGTCGTCCAGTGCCCCCAACCTGGTGAAGGGCCCAAGGAGT 
ACCCCGGCCCTGCCAGGGTTCACCAGCCTTATGGAGATGGAGGATGAGGACAGTGAAGGCCCAGGGAGTGGAGAGAGTCG 
CCTACAGCATTCACCCAGCCAGTCCTACCTCTGTATCCCATTCCCTCGTGGAGAGCCCACCCCAGTCAACTCGGCCACGA 
GTACCCCTCAGCTGACGCCAACCAACAGCCTCAAGCGGGGCGGTGCCCACCACCGCCGCTGCGAGGTGGCTCTGCTCGGC 
TGTGGGGCTGTTCTGGCAGCCACAGGCCTAGGGTTTGACTTGCTGGAAGCTGGCAAGTGCCAGCTGCTTCCCCTGGAGGA 
GCCTGAGCCACCAGCCCGGGAGGAGAAGAAAAGACGGGAGGGTC T TTTTCAGAGGTCCAGCCGTCCTCGTCGGAGCACCA 
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10 



15 



20 



GCCCCCCATCCCGAAAGCTTTTCAAGAAGGAGGAGCCCATGCTGTTGCTAGGAGACCCCTCTGCC^TCCCTGACGCTGCTC 
TCCCTCTCCTCCATCTCCGAGTGCAACTCCACACGCTCCCTGCTGCAGTCCGACAGCGATGAAATTGTCGTGTATGAGAT 
GCCAGTCAGCCCAGTCGAG<SCCCCTCCC<^GAGTCCATGTACCCACAACCCCCTGGTCAATGTCCGAGTAGAGCGCTTCA 
AACGAGATCCTAACCAATCTCTGACTCCCACCCATGTCACCCTCACCACCCCCTCGCAGCCCAGCAGTCAGCGGCGGACT 
CCTTCTGATGGGGCCCTTAAGCCAGA^ACTCTCCTAGCCAGCAGGAGCCCCAGTCCCAGCCGAGACCCAGpTGAATTCCC 
CCGTCrCCCTGACCCCAATGTGGTCTTCCCCCCAACCCCAAGGCGCTGGAACACTCAGCAGGACTCTAtCTrGGAGAGAC 



AGCCATGCCCGCAGCACCTCCCCAGCCAACAGCTCCAGCACAGAGACGCCCGGGCCGCTGCCCCCGACTGAGCGGACGCT 
CCTGGACCTGGATGCAGAGGGGCAGAGTCAGGACAGCACCGTGCCGGTGTGCAGAGCGGAACTGAACACACACAGGCCTG 
CCCCTTATGAGATCCAGCAGGAGTTCTGGTCTTAGCACGAAAAGGATTGGGG 



The disclosed NOV7 nucleic acid sequence, localized to chromosome 14q24.3, has 
i 1 1 1 ' 1 

1460 of 2133 bases (68%) identical to a Homo sapiens MST serine/threonine kinase mRN'A 

(gb:GENBAl^-lD!HSMSTMR|acc:Z48615.1) (E - 9.8e" 164 ). 

A disclosed NOV7 polypeptide (SEQ ID NO: 16) encoded by SEQ ID NO: 15 is 1024 

amino acid residues and is presented using the one-letter amino acid code in Table 7B. Signal 

P, Psort and/or Hydropathy results predict that NOV7 contains a signal peptide and is likely to 

be localized to the plasma membrane with a certainty of 0.4600. The most likely cleavage site 

for a NOV7 peptide is between- amino acids 17 and 18, at: AAA-fiP. 



Table 7B. Encoded NOV7 protein sequence (SEQ ID NO:16). 



MEPSRALLGCLASAAAAAPPGEDGAGAGAEEEEEEEEEAAAAVGPGELGCDAPLPYWTAVFEYEAAGEDELTLRIiGDVVE 
VLSKDSQVSGDEGWWTGQLNQRVGIFPSNYVTPRSAFSSRCQPGGEIDFAELTLEEIIGIGGFGKVYRAFWIGDEVAVKA 
ARHDPDEDISCTIENVRQEAKLFAMLKHPNIIALRGVCLKEPNLCLVMEFARGGPLNRVLSGKRIPPDILVNWAVQIARG 
MNyiiHDEAIVPIIHRDLKSSNVLILQKVENGDLSNKILKITDFGLAREWHRTTKMSAAGTYAWMAPEVIRASMFSKGSDV 
WSYGVLbWELLTGEVPFRGIDGIiAVAYGVAmKIiALPIPSTCPEPFAKI^EDCVmPDPHSRPSPrNILDQLTTIEESGFF 
EMPKDSFHCLQDIWKHEIQEMFDQLRAK^KELRTWEEELTRAALQQKNOEELLRRREQEIjAEREIDIIjERELNIIIHQLC 
QEKPRVKKRKGKFRKSRLKLKBGNRISLPSGFQHKFTVQASPTMDKRKSJjINSRSfePPASPTIIPRXjRAIQCETVSQISW 
GQNTQGHLSESSKTWGRSSWPKEEGEEEEKRAPKKKGRTWGPGTLGQKEIjASGDELKSLVDGYKQWSSSAPNLVKGPRS 

tpalpgftslmemededsegpgsgesrlqhspsqsylcipfprgeptpvnsatstpqltptnslkrggahhrrcevallg 
cgaviiaatglgfdllieagkcqllpleepeppareekkrregl,fqrssrprrstsppsrkbfkkeepmlllgdpsasijtll 

SLSSISECNSTRSLjLQSDSDEIWYEMPVS PVEAPPLSPCTHNPLVNVRVERFKRDPNQSIjTPTHVTIjTTPSQPSSHRRT 
PSDGALKPETLbASRSPSPSRDPGEFPRLPDPNWFPPTPRRWNTQQDSTLERPKTLEFLPRPRPSANRQRbDPWWFVSP 
SHARSTSPAMSSSTETPGPLPPTERTLLDLDAEGQSQDSTVPLCRAEIjNTHRPAPYEIQQEFWS 



The NOV7 amino acid sequence has 496 of 887 amino acid residues (55%) identical 
to, and 607 of 887 amino acid residues (68%) similar to the Homo sapiens 954 amino acid 
residue mitogen-activated protein kinase kinase kinase 10 (ec 2.7.1.-) (mixed lineage kinase 2) 
(protein kinase MST) (ptnr:SWISSPROT-ACC:Q02779) (E - L7e 229 ). 

NOV7 is expressed in at least the following tissues: testis, kidney, and whole 
organism. This information was derived by determining the tissue sources of the sequences 
that were included in the invention including but not limited to SeqCalling sources, Public 
EST sources, Literature sources, and/or RACE sources. In addition, NOV7 is predicted to be 
expressed in various tissues because of the expression pattern of a closely related Homo 
sapiens MST serine/threonine kinase mRNA homolog (gb:GENBANK- 
ID:HSMSTMR|acc:Z48615.1. 

Possible small nucleotide polymorphisms (SNPs) found forNOV7 are listed in Table 

7C. 
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Table 7C: SNPs 


Variant 


Nucleotide 
Position 


Base 
Change 


Amino 
Acid 
Position 


Base 
Cbange 


13376557 


373 


OT 


Silent 


N/A 


13376556 


415 


C>T . 


Silent 


N/A 


13374149 


578 


G > A 


122 


Gly> Arg 


13376555 


588 


A > G 


125 


Glu > Gly 


13376554 


716 


OT 


168 


Pro > Ser 


13376553 


753 


A> G 


180 


Gin > Arg 


13376552 


819 


C>T 


202 


Thr>Ile 



, NOV7 has homology to the amino acid sequence shown in the BLASTP data listed in 

Table 7D. 



Table 7D. BLAST results for NOV7 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


Cji| 12005724 | gb j AAG4 4 


mixed lineage 
kinase MLK1 [Homo 
sapiens] 


1066 


834/1080 
(77%) 


837/1080 
(77%) 


0.0 


591 .1 | AF251442 1 
(AF251442) 


qi| 14 749517 jref jXP 0 


mitogen- activated 
protein kinase 
kinase kinase 9 
[Homo sapiens] 


922 


.702/936 
(75%) 


705/936 
(75%) 


0 . 0 


27237. lj (XM 027237) 


gi|6686295|sp|Q02779 


MI TOGEN- ACTIVATED 

PROTEIN KINASE 
KINASE KINASE 10 
(MIXED LINEAGE 

KINASE 2) 
(PROTEIN KINASE 
MST) [Homo 
sapiens] 


954 


506/965 
(52%) 


607/965 
1 (62%) 


0.0 


| M3KA HUMAN 


gi | 4505263 JreffNP 00 


mitogen- activated 

protein kinase 
kinase kinase 10; 

mixed lineage 
kinase 2 (tyr and 
ser/thr 
specificity) ; 
MKN28 kinase 
[Homo sapiens] 


953 


504/964 
(52%) 


606/964 
(62%) 


0.0 


2437. 1| (NM 002446) 


gi|462606|sp|P80192| 


MI TOGEN - ACT I VATED 
PROTEIN KINASE 
KINASE KINASE 9 
(MIXED LINEAGE 
KINASE 1) [Homo 
sapiens] 


394 


346/369 
(93%) 


347/369 
(93%) 


0.0 


M3K9 HUMAN 



5 The homology of these sequences is shown graphically in the ClustalW analysis shown 

in Table 7E. 

Table 7E. Information for the ClustalW proteins 

1) NOV7 (SEQEDNO:l6) 

2) gi|120057241gblAAG4459UIAF251442_l (AF25 1442) mixed lineage kinase MLK1 [Homo sapiens] (SEQ 
ID NO:95) 
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3) gi[l 47495 nireflXP 027237.lt (XMJ)27237) mitogen-activated protein kinase kinase kinase 9 [Homo 
sapiens] (SEQ ID NO:96) 

4) gi|66862951 s pIO02779lM3KA HUMAN MITOGEN-ACTIVATED PROTEIN KINASE KINASE KINASE • 
1 0 (MIXED LINEAGE KINASE 2) (PROTEIN KINASE MST) [Homo sapiens] (SEQ ID NO:97) 

5) gi[4505263)reflNP 002437.1) (NMJ)02446) mitogen-activated protein kinase kinase kinase 10; mixed ■ 
lineage kinase 2 (tyr and ser/thr specificity); MKN28 kinase [Homo sapiens] (SEQ.ID NO:98) 

6) gil462606[splP80192tM3K9__HUMAN MITOGEN-ACTIVATED PROTEIN KINASE KINASE KINASE 9 
(MIXED LINEAGE KINASE 1) [Homo sapiens] (SEQ ID NO:99) 



gi 
gi 
gi 
gi 



12005724 I 
14749517| 
6686295 | 
4505263 I 
462606 | 



10 20 ,30 40 50 

| | | | j | j | | |.. 

MEPSPALLGCIiASAAAAAPPGEDGAGAGAEEEEEEEEEAAAAVGPGELGCDAP: 

' p] 




- ME E EEGA VA KEWGTT PAG 
- ME EE EGAVA KEWGTT PAG 



NOV7 

gi | 12005724 | 
gi|l4749517| 
gi | 6686295 | 
gi|4505263 j 
gi}462606| 



NOV7 

gi|l2005724| 
gi jl4749517| 
gi|6686295| 
gi [4505263 j 
gi|462606| 



NOV7 



gi 
gi 

gi 
gi 



12005724 j 
14749517 | 
6686295 | 
4505263( 
462606| 



NOV7 

gijl2005724| 
gi|l4749517| 
gi (6686295 | 
gi|4505263| 
gi |462606 [ 



NOV7 

gi J12005724 | 
gi Jl4749517| 
gi |6686295 | 
gi 4505263 j 
gi |462606| 



NOV7 

gi{l2005724j 
gijl4749517| 
gi j6686295) 
gi [4505263 j 
gi |462606| 



NOV7 




120 



Jr§af^srcqpggh 

rgRSAF^'sRCQPGG^DPS CYPPIQK 



RVGffiFPSgYVgPj 
RVgSfPsSyvSp; 



S^PAaPAGLQLPg- 
3APAffl|PAGLQIjPp- 



170 



180 



210 




230 



clkepglclvmefarggp; 
clkepSlclvmefarggp; 



gRVLSGKRXPPDIIi 

Srvlsgkrippdii/ 



250 
. . j .. 



I 



260 



270 



CLKEPmLCLVMEFARGG PI^RVLSGKRXPPDILVffiWAVQ lARGM^ 
CLKEP^CIiViyrEFAJlGGPLjSRVLSGKRIPPDILvSmVQIARGW 
ClfflPHLCLVMERARGGSliSRVlI^ 



AVQIARGMgYLHDE AIVPI IHRDLKSSfijflLILQI 
jWAVQIARGMMYLHDEAIVPI rHRDLKSsSlIiII.QK 

'wavqiargmSylhdeaivpi ihrdlkssSililq 

WAVoSaRGmSyLH^aBvPI IHRDLKS FglLXL& 

wav^JargmSylh^aqvp i r HRDLKSgrar Lr l 

iWAVQIARGMtSyLHDEAI VPI IWRDhKSsSllilJjQK 



290 



300 



310 



320 



330 



340 



350 



VeSgDLsSkI LKITOFGLAREWtiRTTKMS AAGTYAWMAPE VI F| 
VBlGDLs BjaL KITDFGIJUraWHRTTK^ 
EgteSL^D^BLKXTDFGLAREWl^TTKMSAAGTYAWI^ 

^s^ls^lkitdfgiarew^^ 



■ I . 



360 



I 



3 70 
..I .. 



I ■ 



380 



390 
..I .. 



400 



410 



420 



RGIDGlAVAYGVAM^KIiALPIPSTCPEPFAKLMEDC^ 
RGIDGIAVAYGVAMwKLAXPIPSTCPEPFAKLMED^^ 
RGIDGIJWAYGVAr^KIaALPIPSTCPEPFAKL^ 
RglDSLAVAYGVAI^Kl^LPIPST^^ 
RgIDgJ^VAYGVAMgKl||LPIPSTCPEPF^ 

RG I DGlSVAYG VAMfflfCIALP rPSTCPEPFAKLMEnJCT^PDPHSRPSFTajriiDQLTTIEESGFFEMPKDS Fi 




470 



480 



490 



S33 




jaw 


* 






- 




m 


• '5 : 
♦ : 






m 





QLGQEKPRVKKRKGKF 



510 
I- 



530 



59 



540 



VQASPTffiDKR 
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gi|l200S724 | 
gi[l4749517| 
gi)6686295] 
gi|4505263 | 
gij462606| 



NOV7 



91 

gi 
gi 
gi 
gi 



12005724 J 
14749517 j 
6686295 | 
4505263 j 
462606 j 



NOV7 

gi|l2005724| 
gijl4749517j 
gi j 6686295 j 
gi|4505263 j 
gij462606| , 



WOV7 

gi j 12005724 1 
gijl47495l7 j 
gij 6686295| 
gi|4505263 j 
gi [462606 | 



N0V7 

gi|l2005724j 
gi|l4749517 | 
gij 6686295 j 
gi|4505263 j 
gi{462606 I 



N0V7 

gijl2005724 | 
gijl4749517| 
gi(6686295 j 
gi|4505263| 
gi|462606 | 



N0V7 

gi|l2005724 | 
gi|l4749517 j 
gij6686295 | 
gi|4505263 j 
gi|462606 | 



NOV 7 

gi {12005724 | 
gi |14749517| 
gi|6686295 | 
gi|4505263 j 
gi |462606| 



N0V7 

gi|l2005724| 
gi |14749517 j 
gij 6686295 j 
gi|4505263 j 
gi|462606 | 



SPPASPgilP 

sppaspHiip 
sppaspHiip 
sppaspHiip 




93 0 



940 




- """iLTTPSQPSi 

(LTTPSQPSJ 
LTTPSQPSS 
k VCAVSRG 



950 
,..|... 



HRR1 
HRR"] 


:psdg 
:psdg 


2 




PPSDG 
CPSDG 


5 

1 


233 







960 
, . I . . 



-QR- -GPPE 



-GPPEgAGHg ~ ~ 




1000 



1010 




1020 



SSwNT^QDSTLEp 

SgRgp^FPG EEli 

SRSPfeFPG- 





1080 
..|... 



1120 



(SNLDSCFAS S S STVEERPGLPALLPFQA 
SNLDSCFASSSSTVEERPGLPALLPFQA 

ESP 

ESP 
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NOV7 

gi [12005724 | 
gi}l4749517) 
gij66862S5| 
gi|4505263| 
gi (462606) 



1130 

J....I....I.... 

gELfTORPAPYElQQEFWS 

Bbli™rpapyeiqqefws 

SeIjNIiSRPAPYEIQQEFWS 

gHG-&ffl- 

0HG-is3 



Tables 7F - 7J list the domain description from DOMAIN analysis results against 
NOV7. This indicates that the N0V7 sequence has properties similar to those of other 
proteins known to contain these domains. 



Table 7F Domain Analysis of NOV7 



qnl,| Smart | smart00219 , TyrKc, Tyrosine kinase, catalytic domain; 
Phosphotransferases. Tyrosine- specif ic kinase subfamily. (SEQ ID 
NO:100) 

Length = 258 residues, 100.0% aligned 

Score = 260 bits (665) , Expect = 2e - 70 



NOV7: 


132 


00219 i 


1 


NOV7: 


187 


00219: 


58 


NOV7: 


244 


00219: 


118 


NOV 7 : 


301 


00219: 


167 


NOV7 : 


359 


00219 : 


226 



LTLEEIIGIGGFGKVYRAFWIGD EVAVKAARHDPDED I SQT I ENVRQE A KkFAML 186 

III + +1 I 11 + 11+ t Illll +1 I .III +M + I I 

LTLG KKLGEG AF GE V YKGTL KGKGGVE V E V AV KTLKEDAS E QQIEEFLREARLMRKL 5 7 

KHPNIIALRGVCLKEPNLCLVMEFARGGPLNRVL SGKRIPPDILVNWAVQIARGMNY 243 

1111+ I III +1 I +111+ til I |'+ I+ + +I+IIIIII I 

DHPNIVKLLGVCTEEEPLMIVMEYMEGGDLLDYLRKNRPKELSLSDLLSFAIiQIARGMEY 117 

LHDEAIVPIIHRDLKSSNVLILQKVENGDLSNKILKITDFGLAREWHRTTKMSAAGT--- 300 
I +1111 + I 1+ II +11 IIIIII+ + 

LES - - - KNFVHRDLAARNCLV GENKTVKIADFGLARDLYDDDYYRKKKSPRL 166 

- YAWMAPEVIRASMFSKGSDWSYGVLLWELLT-GEVPFRGIDGLAVAYGVAMNKLALPI 3 58 

Illll ++ 1+ 11111+111111+ I II 1+1+ I + II 

PIRWMAPESLKDGKFTSKSDVWSFGVIiriWErFTLGESPYPGMSNEEVI.EYL-KKGyRrjPQ 225 

PSTCPEPFAKLMEDCWNPDPHSRPSFTNILDQL 3 91 

I II- II II If M + I+++++I 

PPNCPDEIYDLMLQCWAEDPEDRPTFSELVERL 2 58 



Table 7G Domain Analysis of NOV7 

gnl 1 Pfamlpfam00069 , pkinase, Protein kinase domain. (SEQ ID NO:101) 
•Length = 256 residues, 98.0% aligned 

Score = 255 bits ( 652) , Expect = 8e-69 



NOV7: 


132 


00069 : 


1 


NOV7 : 


190 


00069 : 


58 


NOV7 : 


248 


00069 : 


118 


NOV7 : 


305 


00069 : 


167 


NOV7 : 


364 



LTLEEI IGIGGFGKVYRAFWI - - GDEVAVKAARHDPDEDI SQTIENVRQEAKLFAMLKHP 189 

I 1 +1 I 11111+ 1+ ll+l + -1+ + +1 ++ I II 

YELGEKLGSGAFGKVYKGKHKDTGEI VAI KI LK KRSLSEKKKRFLREIQI LRRLSHP 5 7 

NIIALRGVCLKEPNLCLVMEFARGGPLNRVLSGKR- - I PPDILVNWAVQIARGMNYLHDE 24 7 

11+ Ml ++ +1 11(1+ II I I + l + ll 11+ III 

NI VRLLGVFEEDDHLYLVMEYMEGGDLFDYLRRNGLLLSEKEAKKI ALQILRGLEYLHSR 117 

AIVP1IHRDLKSSNVLILQKVENGDLSNKILKITDFGLAREWHR TTKMSAAGTYAWM 3 04 

II Illll 1+1+ III + 11111111+ + II +1 

GIV HRDLKPENILL DENGTV KIADFGLARKLESSSYEKLTTFVGTPEYM 166 

APEVI RASMFSKGSDVWS YGVLLWELLTGEVP FRGI DGL - AVAYG VAMN KLAL P I PSTCP 363 

UII+ +i mi i i+i+i i m i++i i in i + +i ii+i i 

APEVLEGRGYSSKVDVWSLGVILYELLTGKLPFPGIDPLEELFRIKERPRLRLPLPPNCS 22 6 
EPFAKLMEDCWNPDPHSRPSFTNIL 3 88 
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00069; 227 



I I++ I I II 11+ II 

EELKDLIKKCLNKDPEKRPTAKEIL 251 



Table 7H Domain Analysis of NOV7 

gnl ] Smart | smart0Q220 , SJTKc, Serine/Threonine' protein kinases , 
catalytic domain; Phosphotransferases. Serine or threonine- specif ic 
kinase subfamily. (SEQ ID NO: 102) 
Length = 256 residues, 97.7% aligned 

Score = 221 bits (562) , Expect g 2e-58 ' ■ 



WOV7 : 


133 


00220 : 


2 


N0V7: 


191 


00220: 


60, 


NOV7 : 


250 


00220 : 


119 


NOV7 : 


308 


00220 : 


169 


NOV7 : 


366 


00220 : 


229 



TLEE1 IGIGGFGKVY I RAFWI- -GDEVAVKAARHDPDEDISQTIEMVRQEAKLPAMLKHPN 190 

I M I llllfl MM ++ + I + +11+ Mi 

ELLEVLGKGAFGKV YLARDKKTGKLVAI K- - VI KKEKLKKKKRERILREI KI LKKLDHPN 5 9 
I IALRGVCLKEPNLCLVMEFARGGPLNRVL- SGKRI PPDILVNWAVQIARGMNYLHDEAI 24 9 

1+ M + I 1111+ II I +1 1+ I +1 II + III + 

I VKL YDV FEDDDKL YL VME YCEGGDLFDLLKKRGRLSEDEARF YARQI LSALE YLHSQG - 118 
VPI IHRDLKSSNVLIIiQKVENGDLSNKILKITDFGLAREWHRTTKM- - SAAGTYAWMAPE 30 7 

mini i+. +i+ IMII++ +1+M +mi 

- - IIHRDLKPENILL D SDGHVKJjAD FGL AKQLD SGGTLLTTF VGTPE YMAP E 16 8 

VIRASMFSKGSDVWSYGVLLWELLTGEVPFRGIDGLAVAYGVAMNKLALPIP- -STCPEP 3 65 

1+ + ! l + n II + I + IIIII+ MM) + I , 

VLLGKGYGKAVDIWSLGVILYELLTGKPPFPGDDQLLALFKKIGKPPPPFPPPEWKISPE 22 8 

FAKLMEDCWNPDPHSRPSFTNIL ■ 388 

I++ 111+ I 

AKDLI KKLLVKDPEKRLTAEEAL 251 



Table 71 Domain Analysis of NOV7 

gnl | Smart | smart00326 , SH3 , Src homology 3 domains; Src homology 3 
(SH3) domains bind to target proteins through sequences containing 
proline and hydrophobic amino acids. Pro-containing polypeptides may 
bind to SH3 domains in 2 different binding orientations. (SEQ ID 
NO: 103) 

Length = 5 9 residues, 96.6% aligned 

Score = 72.0 bits (175), Expect = 2e-13 



NOV? : 
00326 

N0V7: 112 
00326: 56 



5 3 PLP YWTAVFE YEAAGEDELTLRLGDVVEVLSKDSQ VSGDEGWWTGQL - NQRVG I FPSN YV 111 

I I+++I I 111+ + II++ II I Mil l+l + l+llllll 

1 EGPQVRALYDYTAQDPDELSFKKGDI ITVLEKS DDGWWKGRLGTGKEGLFPSNYV 55 

TP 113 

EE 57 



Table 7J Domain Analysis of NOV7 

gnl 1 Pf am] pf am00018 , SH3 , SH3 domain. SH3 (Src homology 3) domains are 
often indicative of a protein involved in signal transduction related 
to cytoskeletal organization. First described in the Src cytoplasmic 
tyrosine kinase. The structure is a partly opened beta barrel. (SEQ ID 
NO: 104) 

Length = 57 residues, 98.2% aligned 

Score t= 65.1 bits (157), Expect = 2e-ll 



NOV7: 55 PYWTAVFEYEAAGEDELTLRLGDWEVLSKDSQVSGDEGWWTGQL-NQRVGIFPSNYVTP 113 

I + ||++ III I I III l + l + 1+ 1 1 1 1 1 I 

00018: 1 PKWALYDYQARESDELEFKKGDIIIVLEK SDDGGWWKGRLKGTKEGLI PSNYVEP 56 
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Eukaryotic protein kinases make up a large superfamily of proteins which share a 
conserved catalytic core common with both serine/threonine and tyrosine protein kinases. 
Kinases play a central role in gehe regulation and signal transduction pathways that regulate 
cell proliferation, differentiation and survival (Hunter and Hanks Protein kinases 6. The 
5 eukaryotic protein kinase superfamily: kinase (catalytic) domain structure and classification. 
FASEB J 9:576-96, 1995). SH3 (src Homology-3) domains are found in a great variety of 
intracellular or membrane-associated protems and are 3mall protein modules containing - 
approximately 50 amino acid residues. The SH3 domain may mediate assembly of specific 
protein complexes and are thought ,to be involved in linking signals transmitted from the cell 

1 0 surface by protein tyrosine kinases to "downstream effector proteins" (Baltimore and Mayer 
Signalling through SH2 and SH3 domains. Trends Cell Biol. 3: 8-13, 1993). The SH3 domain 
also regulates aspects of cell proliferation, differentiation and survival. 

Two members of a novel protein kinase family, the mixed-lineage kinases (MLK) have 
been identified based on their expression in human epithelial tumor cells. The'human mixed- 

1 5 lineage kinase 2 (MLK2) (also called MST kinase) protein localizes to human chromosome 
19ql3.2. This gene was expressed in a gastric cancer cell line and epithelial tumor cells. The 
MST gene encodes a novel putative non-receptor type of serine/threonine kinase with Src 
homology 3 (SH3) domain, two leucine zipper domains and proline rich dornain. The role of 
this gene in cancer has not yet been clearly defined but its expression suggests an important 

20 role in disease processes. The novel gene described in this invention is similar to MLK2, 

localizes to chromosome 14 and may play an important role in autoimmune, metabolic and 
neurological diseases as well as cancer (Dorow et al., Complete nucleotide sequence, 
expression, and chromosomal localisation of human mixed-lineage kinase 2. Eur J Biochem , 
234: 492-500, 1995; Katoh et al., Cloning and characterization of MST, a novel (putative) 

25 serine/threonine kinase with SH3 domain. Oncogene 10: 1447-51, 1995). 

The above defined information for NOV7 suggests that this NOV7 protein may 
function as a member of a mixed lineage kinase 2- protein family. Therefore, the NOV7 
nucleic acids and proteins of the invention are useful in potential therapeutic applications 
implicated in various diseases and disorders described below and/or other pathologies. For 

30 example, the NOV7 compositions of the present invention will have efficacy for treatment of 
patients suffering from diabetes, autoimmune disease, renal artery stenosis, interstitial 
nephritis, glomerulonephritis, polycystic kidney disease, systemic lupus erythematosus, renal 
tubular acidosis, IgA nephropathy, hypercalceimia, Lesch-Nyhan syndrome, fertility, cancer, 
trauma, regeneration (in vivo and in vitro), bacterial and/or viral infections. The NOV7 nucleic 
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acid encoding mixed lineage kinase 2-like protein, and the mixed lineage kinase 2-like protein 
of the invention, or fragments thereof, may further be useful in diagnostic applications, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed. 

NOV8 

A disclosed NOV8 nucleic acid of 834 nucleotides (also referred to CG56042-01) 
encoding a novel S-l -like protein is shown in Table 8 A. An open reading frame was identified 
beginning with an ATG initiation codon at nucleotides 215-217 and ending with a TGA codon 
at nucleotides 827-829. Putitive untranslated regions upstream from the initiation codon and 
downstream from the termination codon are underlined in Table 8A } and the start and stop 1 
codons are in bold letters. 

Table 8A. NOV8 Nucleotide Sequence (SEQ ID NO:17) 

ATGACATGGCGGGCAGTGGTCAGGATA C TGACCTGGGCTGCCCTACAGGTGTTGGTAAGTTGGTTATCATGCCCTCTCTC 
ACCCCTACCACAGCTCCCCATGCCGCAATGGCG G GCAGTGCCAGGACGACCAGGGCTTTGCTCTCAACTTCACGTGCCGC 
TGCTTGGTGGGCTTTGTGGGTGCCCGCTGTGAGGTAAATGTGGATGACTGCCTG ATGCGGCCTTGTGCTAACGGTGCCAC 
CTGCCTTGACGGCATAAACCGCTTCTCCTGCCTCTGTCCTGAGGGCTTTGCTGGACGCTTCTGCACCATCAACCTGGATG 
ACTGTGCCAGCCGCCCATGCCAGAGAGGGGCCCGCTGTCGGGACCGTGTCCACGA | CTTCGACTGCCTCTGCCCCAGTGGC 
TATGGTGGCAAGACCTGTGAGCTTGTCTTACCTGTCCCAGACCCCCCAACCACAGTGGACACCCCTCTAGGGCCCACCTC 
AGCTGTAGTGGTACCTGCCACGGGGCCAGCCCCCCACAGCGCAGGGGCTGGTCTGCTGCGGATCTCAGTGAAGGAGGTGG 
TGCGGAGGCAAGAGGCTGGGCTAGGTGAGCCTAGCTTGGTGGCCCTGGTGGTGTTTGGGGCCCTCACTGCTGCCCTGGTT 
CTGGCTACTGTGTTGCTGACCCTGAGGGCCTGGCGCCGGGGTGTCTGCCCCCCTGGACCCTGTTGCTACCCTGCCCCACA 
CTATGCTCCAGCGTGCCAGGACCAGGAGTGTCAGGTTAGCATGCTGCCAGCAGGGCTCCCCCTGCCACGTGACTTGCCGC 
CTGAGCCTGGAAAGACCACAGCACTGTGATGGAG 

The disclosed NOV8 nucleic acid sequence has 213 of 261 bases (81%) identical to a 
Mus musculus S-l mRNA (gb:GENBANK-ID;AB01 1 01 9|acc:AB01 1019.1) (E - 2.3e" 32 ). 

A disclosed NOV8 polypeptide (SEQ ID NO: 18) encoded by SEQ ID NO: 17 is 204 
15 amino acid residues and is presented using the one-letter amino acid code in Table 8B. Signal 
P, Psort and/or Hydropathy results predict that NOV8 does not contain a signal peptide and is 
likely to be localized at the mitochondrial inner membrane with a certainty of 0.8218 and to 
the plasma membrane with a certainty of 0.7000. 

Table 8B. Encoded NOV8 protein sequence (SEQ ID NO:18). 

MRPCANGATCLDGINRPSCLCPEGFAGRFCTINLDDCASRPCQRGARCRDRVHDFDCLCPSGYGGKTCELVLPVPDPPTT 
VDTPLGPTSAVWPATGPAPHSAGAGLLRISVKEWRROEAGLGEPSLVALWFGALTAALVLATVljLTliRAWRRGVCPP 
GPCCYPAPHYAPACQDQECQVSMbPAGLPLPRDLPPEPGK T TAL 

20 The NOV8 amino acid sequence has 71 of 84 amino acid residues (84%) identical to, 

and 73 of 84 amino acid residues (86%) similar to, a Mus musculus 102 amino acid residue S- 

1 protein (ptnr:SPTREMBL-ACC:Q9QYP3) (E = 6.4e 33 ). 

NOV8 is expressed in at least the following tissues: Adrenal Gland/Suprarenal gland, 

Brain, Kidney, Liver, Lung, Mammary gland/Breast, Pancreas, Parietal Lobe, Pituitary Gland, 

25 Prostate, Synovium/Synovial membrane, Testis, Uterus, and Whole Organism. This 
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information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, Public EST sources, 
Literature sources, and/or RACE sources. ' 

Possible small nucleotide polymorphisms (SNPs)'found for NOV8 is listed in Table 

5 8C. 



Table SC.: SNPs 


Consensus 
Position 


Depth 


Base 
Change 


PAF 


384 


54 


C>T 


0.093 


506' 


65 


A > G 


0.031 


547 


67 


G > A 


0.463 


569 ' 


65 


G > A 


0.031 


589 


65 


G> A 


0.477 


622 


65 


G> A 


0.031 



The disclosed NOV8 polypeptide also has homology to the amino acid sequences 



shown in the BLASTP data listed in Table 8D. 



Table 8D. BLAST results for NOV8a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi 1 13027596 | ref |NP 


hypothetical 
protein MGC24 87 
[Homo sapiens] 


204 


148/204 
(72%) 


148/204 
(72%) 


2e-63 


076421 .1 | 
(NM_023932) 


gi | 6 6 8273 9 | db j | BAA8 


S-l [Mus musculus] 


102 


63/118 
(53%) 


64/118 
(53%) 


7e-17 


8686.1) (AB011019) 


gil668H97|ref |NP 0 
31891 .1 | 
<NM_007865) 


delta-like 1 
(Drosophila) ; 
delta-like 1 
homolog 
(Drosophila) [Mus 
musculus] 


722 


38/66 
(57%) 


45/66 
(67%) 


3e-lS 


gi|l4091746 |ref |NP 


delta 
(Drosophila) -like 
1 [Rattus 
norvegicus] 


714 


37/66 
(56%) 


45/66 
(68%) 


4e-15 


114452 .1 | 
(NM_Q32063) 


gi|807696|qbjAAC380 
17.11 (L42229) 


x-Delta-l [xenopus 
laevis] 


721 


38/66 
(57%) 


44/66 
(66%) 


5e-15 



1 0 The homology between these and other sequences is shown graphically in the 

ClustalW analysis shown in Table 8E. 

Table 8E. ClustalW Analysis of NOV8a 

1 ) Novel NOV8 (SEQ ID NO: 1 8) 

2) gill30275961reflNP 076421.11 (NM_023932) hypothetical protein MGC2487 [Homo sapiens] (SEQ ID 
NO: 105) 

3) gi!66827391dbilBAA88686. 1 1 (AB01 1019) S-l [Mus musculus] (SEQ ID NO:106) 

4) g il668 1 1 97l rellNP 031891.11 (NM_007865) delta-like 1 (Drosophila); delta-like 1 homolog (Drosophila) 
[Mus musculus] (SEQ ID NO: 1 07) 

5) gill40917461reflNP 114452.11 (NM_032063) delta (Drosophila)-like 1 [Rattus norvegicus] (SEQ ID NO: 108) 

6) gil807696 | gblAAC38017.1l (L42229) x-Delta-l [Xenopus laevis] (SEQ ID NO: 109) 
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30 



NOV8 

gi| 13027596} 
gi|6682739 j 
gi(6681197j 
gi [l4091746| 
gi|807696| 



.|-....| 



l MGRRSALALAWSALLCQVWSSGVFELKLQEF'VNKKGLLGNRNCCRGG S G P P CACRTF FRVCLiKHYQ 

'mGRRSALALAWSALLCQWSSGVFELKLQEFVNKKGLLGNRNCCRGG S G PP CACRTF FRVCLKHYQ 

MGQQRMLTLLVIiSAVLCQISCSGLFELRLQEFVNKKGLLGNMNCCRPGSLASLQRCECKTFFRlCDKHYQ 



NOV8 

gi|l3027596 ] 
6682739 | 
6681197 j 
140917461 



gi 



gi|807696| 



80 



90 100 



110 



120 



130 



140 



ASVSPEPPCTYGSAVTPVLGVDSFSLPDGAGIDPAFSNPIRFPFGFTWPGTFSLI1EALHTDSPDDLATE 
ASVSPE.PPCTYGSAVTAVLGVDSFSLPDGAGIDPAFSNPIRFPFGFTWPGTFSLIIEALHTDSPDDLATE 
SNVSPEPPCTYGGAVTPVLGTNSFWPESSNADPTFSNPIRFPFGFTWPGTFSLIIEAIHADSADDLNTE 



NOV 8 

gi 1 13027596 | 
gi | 6682739| 
gi j 6681197 j 
gi|l4091746 | 
gi|807696 [ 



15(0 



160 1 

.|....|....|. 



170 



180 



190 



200 210 
..|....|....| 



NPERLISRLTTQRHLTVGEEWSQDLHSSGRTDLRYSYRFVCDEHYYGEGCSVFCRPRDDAFGHFTCGDRG 
NPERLISRLTTQRHLTVGEEWSQDLHSSGRTDLRYSYRFVCDEHYYGEGCSVFCRPRDDAFGHFTCGERG 
NPERLISRLATQRHIiTVGEQWSQDLHSSDRTELKYSYRFVCDEYYYGEGCSDYCRPRDDAFGHFSCGEKG 



NOV8 

gi|l3027596| 
gi|6682739 | 
gij 6681197 j 
gi|l4091746| 
gi|807696| 



220 23,0- 



240 



250 260 270 280 



EKMCDPGWKGQYCTD P I CLPGCDDQHGYCDKPGECKCRVGWQGRYCDEC IRYPGCLHGTCQQPWQCNCQE 
EKMCD PG WKG QYCTD P I CLPG CDDQHG YCDKPGECKCRVG WQGR YCD E C I RYPGCLHGTCQQ PWQCNCQE 
EKLCNPGWKGLYCTEPICLPGCDEHHGYCDKPGECKCRVGWQGRYCDECIRYPGCLHGTCQQPWQCNCQE 



NOV8 

13027596 | 
6682739) 
6681197) 
14091746 | 
807696 j 



gi 
gi 
gi 
gi 
gi 



290 



300 



310 



320 



330 



340 350 
..|....|....| 



GWGGLFCNQDLNYCTHH KP CRNG ATCTNTGQG S YTCS CR PG YTG ANCE LEVD ECAP S P CKNGAS CTDLED 
GWGGLFCNQDLNYCTHHKPCRNGATCTNTGQGSYTCSCRPGYTGANCELEVDECAPSPCRNGGSCTD1.ED 
GWGGLFCNQDLNYCTHHKPCENGATCTNTGQGSYTCSCRPGYTGSNCEIEVNECDANPCKMGGSCSDIjEN 



NOV8 

gi|l3027596| 
gi|6682739 j 
gi) 6681197 j 
gi|l4091746) 
gi|807696 | 



360 



370 



380 



390 



400 



410 



420 



SFSCTCPPGFYGKVCELSAMTCADGPCFNGGRCSDNPDGGYTCHCPLGFSGFNCEKKMDLCGSSlgSN 
SYSCTCPPGFYGKVCELSAMTCADGPCFNGGRCSDNPDGGYTCHCPAGFSGFNCEKKIDLCSSS^SN 
SYTCSCPPGFYGKNCELSAMTCADGPCFNGGRCADNPDGGYICFCPVGYSGFNCEKKIDYCSSN 



NOV8 


gi 


13027596 


gi 


6682739 | 


gi 


6681197 | 


gi 


14091746 


gi 


807696] 



NOV8 

gi)l3027596j 
gi (6682739) 
gi[6681197| 
gi)!4091746| 
gi|807696| 



430 
J 

TgLjgGINRj]SgLgp: 

tElSginr^sElEp] 



480 




ELVpP-VPDP 
jELV^jP-VPDP 



JSAPVSRCEHA 
iTgRNMSAPVSRCEHA 

uSSnHsmpxtkcehn 




530 

■|....|..-.|....| 

gTTgDTPLGPT^wgpATGPAPHS^AGLg'R PCSV. 

gTTgDTPLGPT^VvRpATGPAPHsSBAGLfflR jl'sV] 

MPT^jwBPATGPAPHsSgAGL JR jljsVKgtfy F 

gCHNGATCHQRGQRYMCECAQGYGGPNCQF *LPEPPPGPMvjvto;iJsgRHM^ 

JCHNGATCHQRGQRYMCECAQGYGGSNCQF jliPEPPP DLljvAAC^SFPWVAVCAGBvffiKliL 

ICHNGATCHERNNRYECQCARGYGGKfNCQFgl/PEEKP - - VV^^tJ^K YTEGgS^F PW^VCAGI V|^I*M 



560 

I LGEPSLffiATOVF 

^GEP^lJ^SByF 



NOV8 

gi|l3027596| 



570 



590 



600 



GA^Agi^LATRLg.T[jRAwERGVC| 

at3l-St| 



jTAgjL 



2rgv< 




gACpDQE.CQ 



610 

\h-- 



620 
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gi | 6682739 [ 
gi|6681197j 
gi j 14091746 j 
gij 807696 j 



N0V8 

gi j 13027596 | 
gij 6682739| 
gi[6681197| 
gi 1 14091746 | 
gij 807696 | 



NOV 8 

gi jl3027596 | 
gij 6682739| 
gi|6681197) 
gijl4091746 1 
gi | 807696 | 



GSg.TAgLRi^TffiLg.TfflRXwgRGI cgTgpCCYPA^yS^QDQECQggMIj — ' 

LL^CgASvycSRfl,K^KHQPPPEgC^ETETM^ 

IjL jjGCgAWWCBRjjKg^KHid P P PD§C^ET ETMNNLSNCQRE KP vs Ejj ? I^GATQI KNTNKKAt)PHGDHGA 
DlflGCaASviVcSRy ^RVQKRgHQPEACRGESKTM^I^NC^REiqDI sE^GTTQI KNTNKKIDPLSESNN 

640 650 660 670 ' 680 69*0 ' 700 



-PAGL; 

PAGL 

PAGF 

kkssfkvryptvd: 
dkssfkaryptvdVn 
ekngykprypsvd" 

710 




.|....|. 



.|....|. 



KGDEATVRLTHSKRDTKCQSQSSAGEEKIAPTLRGGEIPD] 
■KGDEATVRDAHSKRDTKCQSQGSVGEEKSTSTLRGGEVPDRKR 
NEDSPKEERSKCEAKCSSNDSDSED-VNSVHSKRDSSEJ 




7^20 



- 

jSKfflTKYQSVYVLSAEKDECVIATEV 
SKfflTKYQSVYVLSAEKDECVIATEV 

Is'kBtkyqsvyvisdekdeci iatev 



Table 8F lists the domain description fi'om DOMAIN analysis results against N0V8. 
This indicates that the N0V8 sequence has properties similar to those of other proteins known 
to contain these domains. i 



Table 8F Domain Analysis of NOV8 

gnl |Pfamjpfam00008 , EGF, EGF-l'ike domain. There is no clear separation 
between noise and signal. pfam00053 is very similar, but has 8 instead 
of 6 conserved cysteines . Includes some cytokine receptors . The EGF 
domain misses the N- terminus regions of the Ca2+ binding EGF domains 
(this is the main reason of discrepancy between swiss-prot domain 
start/end and Pfam) . The family is hard' to model due to many, similar 
but different sub- types of EGF domains. Pfam certainly misses a number 
of EGF domains. (SEQ ID NO: 110) 
Length = 33 residues, 84.8% aligned 

Score - 35 .0 bits (79) , Expect ~ o,oo4 



NOV8 : 



00000: 



PCANGATCLDGINRFSCLCPEGFAGRFC 3 0 

|| + || || ++ ++ | + || | + | + ] 

PCSNGGTCVNTPGGYTCICPPGYTGKRC 33 



The above defined information for NOV8 suggests that NOV8 may function as a 
member of a S-l protein family. Therefore, the NOV8 nucleic acids and proteins of the 
invention are useful in potential therapeutic applications implicated in various diseases and 

10 disorders described below and/or other pathologies. For example, the NOV8 compositions of 
the present invention will have efficacy for treatment of patients suffering from: 
adrenoleukodystrophy, congenital adrenal hyperplasia, Von Hippel-Lindau (VHL) syndrome, 
Alzheimer's disease, stroke, tuberous sclerosis, hypercalceimia, Parkinson's disease, 
Huntington's disease, cerebral palsy, epilepsy, Lesch-Nyhan syndrome, multiple sclerosis, 

15 ataxia-telangiectasia, leukodystrophies, behavioral disorders, addiction, anxiety, pain, 

neuroprotection, renal artery stenosis, interstitial nephritis, glomerulonephritis, polycystic 

kidney disease, systemic lupus erythematosus, renal tubular acidosis, IgA nephropathy, 

cirrhosis, transplantation, systemic lupus erythematosus, autoimmune disease, asthma, 
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emphysema, scleroderma, allergy, ARDS, fertility, diabetes, pancreatitis, obesity, endocrine 
dysfunctions, growth and reproductive disorders and/or endometriosis. The NOV8 nucleic 
acid encoding S-l-like protein, and the S-l-like protein of the invention, or fragments thereof, 
may further be useful in diagnostic applications, wherein the presence or amount of the nucleic 
acid or the protein are to be assessed. 

NOV9 



A disclosed NOV9 nucleic acid of 3981 nucleotides (also referred to CG5 5997-02) 
encoding a novel Guanine Nucleotide Releasing Protein-like protein is shown in Table 9A. An 
open reading frame was identified beginning with an ATG initiation codon at nucleotides 1-3 
and ending with a TGA codon at nucleotides 3772-3774. A putative untranslated region 
downstream from the termination codon is underlined in Table 9A, and the start and stop 
codons are in bold letters. 



Table 9A. NOV9 Nucleotide Sequence (SEQ ID NO:19) 



ATGCAGAAAGCCATCCGACTTAATGATGGCCACGTCGCGCCCCTGGGACTGCTGGCGCGCAAGGACGGCACGCGCAAAGG 
CTACCTGAGCAAGCGGAGTTCGGACAACACAAAATGGCAAACCAAGTGGTTCGCGCTGCTGCAGAACCTGCTCTTCTACT 
TCGAGAGCGACTCGAGCTCGCGGCCCTCGGGGCTTTACCTGCTGGAGGGCTGCGTCTGCGACCGCGCGCCCTCCCCCAAG 
CCGGCGCTGTCGGCCAAGGAGCCGCTGGAGAAACAGCATTACTTCACGGTGAACTTCAGCCATGAGAACCAGAAAGCCTT 
GGAGCTGAGGACAGAGGACGCAAAAGATTGTGACGAATGGGTGGCAGCCATTGCACATGCCAGCTACAGGACCCTCGCCA 
CAGAG CATG AGGC ATT AATGC AG AAATACCTG CACCTGCTGCAG ATCGTGGAG AC AG AG AAG ACCGTGG CCAAG C AG CTT 
CGGCAGCAGATCGAGGATGGGGAGATCGAGATCGAGCGGCTGAAGGCAGAGATCACATCCCTGCTCAAGGACAATGAGCG 
CATGCAGTCCACCCAGACTGTCGCCCCCAACGATGAAGACAGCGACATCAAGAAAATTAAGAAGGTGCAGAGCTTCCTGC 
GGGGCTGGCTGTGCCGGCGGAAGTGGAAGACCATCATCCAGGACTACATCCGGTCACCCCATGCTGACAGCATGCGCAAG 
AGGAACCAGGTGGTGTTCAGCATGCTGGAGGCTGAGGCTGAGTACGTGCAGCAGCTGCACATCCTTGTCAACAATTTCCT 
GCGCCCGCTGCGGATGGCCGCCAGCTCCAAGAAGCCTCCCATCACACACGACGACGTCAGCAGCATCTTCCTGAACAGCG 
AAACCATCATGTTTTTACATCAGATCTTTTACCAAGGCCTGAAGGCCCGCATCTCCAGCTGGCCCACGCTGGTCCTGGCT 
GACCTATTTGACATCCTGCTGCCCATGCTCAACATCTACCAAGAGTTCGTCCGCAACCACCAGTACAGCCTGCAGATCCT 
GGCCCACTGCAAGCAGAACCGTGACTTCGACAAGCTGCTGAAGCACTACGAGGCCAAGCCTGACTGCGAGGAGAGGACGC 
TGGAGACCTTCCTCACCTACCCCATGTTCCAGATCCCCAGGTACATCCTGACCCTCCATGAGCTCCTGGCCCACACGCCT 
CATGAGCACGTTGAGCGCAACAGCCTGGACTACGCCAAGTCCAAACTGGAGGAGCTGTCCAGAATAATGCACGATGAAGT 
AAGTGAGACGGAGAACATCCGGAAAAACCTGGCCATCGAGCGCATGATCATCGAAGGCTGTGAGATCCTCCTGGACACCA 
GCCAGACCTTTGTGAGACAAGGTTCCCTCATTCAGGTGCCCATGTCTGAAAAGGGCAAGATCACCAGGGGGCGCCTGGGG 
TCTCTCTCCCTAAAGAAAGAGGGCGAGCGACAGTGCTTCCTGTTTTCTAAGCATCTGATTATCTGTACCAGAGGCTCTGG 
AGGGAAGCTTCACTTGACCAAGAATGGAGTCATATCCCTCATTGACTGCACTTTATTGGAGGAGCCAGAAAGCACGGAGG 
AGGAAGCCAAAGGATCCGGCCAAGACATAGATCACTTGGATTTTAAAATCGGGGTGGAGCCAAAGGATTCCCCGCCCTTT 
ACAGTCATCCTAGTGGCCTCGTCCAGACAGGAGAAGGCAGCGTGGACCAGTGACATCAGCCAGTGTGTGGATAACATCCG 
ATGCAATGGGCTCATGATGAACGCATTTGAAGAAAATTCCAAGGTCACTGTGCCGCAGATGATCAAGTCCGACGCCTCCT 
TATATTGTGATGATGTTGACATTCGCTTCAGCAAAACCATGAACTCCTGCAAAGTGCTGCAGATCCGCTACGCCAGTGTG 
GAGCGGCTGCTGGAGAGGCTGACGGACCTGCGCTTCCTGAGCATCGACTTCCTCAACACCTTCCTGCACTCCTACCGCGT 
CTTCACCACCGCCATCGTGGTCCTGGACAAGCTCATTACCATCTACAAGAAGCCTATCAGTGCCATTCCTGCCAGGTCGC 
TGGAGCTCCTGTTTGCCAGTGGCCAGAACAATAAGCTCCTGTACGGTGAACCCCCCAAGTCCCCGCGCGCCACCCGCAAG 
TTCTCCTCGCCGCCACCTCTGTCCATCACCAAGACATCGTCACCGAGCCGCCGGCGGAAGCTCTCCCTGAACATCCCCAT 
C ATCACTGGCG G C AAGG CCCTGG ACCTGGCCG CCCTCAG CTG C AACT CCAATGG CTAC AC CAGCATGTACTCGGCCATGT 
CACCCTTCAGCAAGGCCACGCTGGACACCAGCAAGCTCTATGTGTCCAGCAGCTTCACCAACAAGATTCCAGATGAGGGC 
GATACGACCCCTGAGAAGCCCGAAGACCCTTCAGCGCTCAGCAAGCAGAGCTCAGAAGTCTCCATGAGAGAGGAGTCAGA 
TATTGATCAAAACCAGAGTGATGATGGTGATACTGAAACATCACCAACTAAATCTCCAACAACACCCAAATCAGTCAAAA 
ACAAAAATTCTTCAGAGTTCCCACTCTTTTCCTATAACAATGGAGTCGTCATGACCTCCTGTCGTGAACTGGACAATAAC 
CGCAGTGCCTTGTCGG CCGCCTC TG C CTTTG C C AT AGCAACCGC CGG GGCCAACG AGGG CAC CCCAAAC AAGGAG AAGTA 
CCGGAGGATGTCCTTAGCCAGTGCAGGGTTTCCCCCAGACCAGAGGAATGGAGACAAGGAGTTTGTGATCCGCAGAGCAG 
CCACCAATCGTGTCTTGAACGTGCTCCGCCACTGGGTGTCCAAGCACTCTCAGGACTTTGAGACCAACGATGAGCTCAAA 
TGCAAGGTGATCGGCTTCCTGGAAGAAGTCATGCACGACCCGGAGCTCCTGACCCAGGAGCGGAAGGCTGCAGCCAACAT 
CATCAGGACTCTGACCCAGGAGGACCCAGGTGACAACCAGATCACGCTGGAGGAGATCACGCAGATGGCTGAAGGCGTGA 
AGGCTGAGCCCTTTGAAAACCACTCAGCCCTGGAGATCGCGGAGCAGCTGACCCTGCTAGATCACCTCGTCTTCAAGAAG 
ATTCCTTATGAGGAGTTCTTCGGACAAGGATGGATGAAACTGGAAAAGAATGAAAGGACCCCTTATATCATGAAAACCAC 
TAAGCACTTCAATGACATCAGTAACTTGATTGCTTCAGAAATCATCCGCAATGAGGACATCAACGCCAGGGTGAGCGCCA 
TCGAGAAGTGGGTGGCCGTAGCTGACATATGCCGCTGCCTCCACAACTACAATGCCGTACTGGAGATCACCTCGTCCATG 
AACCGCAGTGCAATCTTCCGGCTCAAAAAGACGTGGCTCAAAGTCTCTAAGCAGACTAAAGCTTTGATTGATAAGCTCCA 
AAAGCTTGTGTCATCTGAGGGCAGATTTAAGAATCTCAGAGAAGCTTTGAAAAATTGTGACCCACCCTGTGTCCCTTACC 
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15 



20 



TGGGGATGTACCTCACCGACCTGGCCTTCATCGAGGAGGGGACGCCCAATTACACGGAAGACGGCCTGGTCAACTTCTCC, 
AAGATGAGGATGATATCCCATATTATCCGAGAGATTCGCCAGTTTCAACAAACTGCCTACAAAATAGAGCACCAAGCAAA 
GGTAACGCAATATTTACTGGACCAATC^TTTGTAATGGATGAAGAAAG 

AACTCCCCACCTGA AGCTGTGCCCAGCCCAGACCCAGCTGCTCCCGGGGACATGTGCTAGATGATACTG'TACATATTCGT ' 
TTGGTTTCACTGGATTTTCXTCTTCAGTATGTGCTTCTCCAAGAATACAAATCGTCCTTGTTCTTAGATyCCTGTAGAAC 
CGGAATATGAATTTCTGCACCGTTTCAGACTTCGCCCACCCATCCCTCCCCTCGCCCGAAT 



The disclosed NOV9 nucleic acid sequence, maps to chromosome 15q24, has 2642 of 
2650 bases (99%) identical to the unidentified mRNA of Sequence 1 from Patent W09321314 

(gb:GEr>iBANK-ID:A75965|acc:A75965.1) (E - 0.0),. 

it * i 

A disclosed NOV9 polypeptide (SEQ ID NO:20) encoded by SEQ ID NO:19is 12<57 
amino acid residues and is presented using the one-letter amino acid code in Table 9B. Signal 
P, Psort and/or Hydropathy results' predict that NOV9 does not contain a signal peptide and is 
likely to be localized to the plasma membrane with a certainty of 0.5500 and in the 
endoplasmic reticulum (membrane) with a certainty of 0.5000. 



Table 9B. Encoded NOV9 protein sequence (SEQ ID NO:20). 



MQKAIRLNDGHVAPLGbliARKDGTRKGYLSKRSSDNTKWQTKWFALLQNLLFYFESDSSS^PSGLYLLEGCVCDRAPSPK 
PALSAKEPLEKQHYFTVNFSHENQKALELRTEDAKDCDEWAAIAHASYRT 

RQQIEDGEIEIERLKAEITSLLKDNERIQSTQTVAPNDEDSDIKKIKKVQSFLRGWLCRRKWKTIIQDYIRSPHADSMRK 
RNQWFSMLEAEAEYVQQLHILVNNFLRPbRMAASSKKPPITHDDVSSIFIjNSETIMFLHQIFYQGLKARISSWPTLVIiA 
DLFDILLP^3LNIYQEFVRNHQYSL0IIiAHCKQNRDPDKLLKHYEAKPDCEERTLETFLTYPMFQIPRYIliTLHEIiLAHTP 
HEHVERNSLDYAKSKbEELSRIMHDEVSETENIRKNLAIERMIIEGCEILIiDTSQTFVRQGSLIQVPMSEKGKITRGRLG 
SLSLKKEGERQCFIjFSKHIillCTRGSGGKLHLTKNGVISLIDCTLLEEPESTEEEAKGSGQDIDHLDFKIGVEPKDSPPF 
TVIIjVASSRQEKAAWTSDISQC^NIRCNGLMMNAFEENSKVTVPQMIKSDASLYCDDVDIRFSKTMNSCKVLQIRYASV 
ERLIjERLTDLRFLSIDFLNTFLHSYRVFTTAIVVLDKLITIYKKPISAIPARSIjELIjFASGQNNKLLYGEPPKSPRATRK 
FSSPPPLSITKTSSPSRRRKLSLNIPIITGGKALDLAALSCNSNGYTSMYSAMSPFSKATLDTSKLYVSSSFTNKIPDEG 
DTTPEKPEDPSALSKQSSEVSMREESDlDQNQSDDGDTETSPTKSPTTPKSVKNIOvrsSEFPLFSYNNGWMTSCRELDNK 
RSALSAASAFAIATAGANEGTPNKEKYRRMSIASAGFPPDQRNGDKEFVIRRAATNRVLNVLRHWVSKHSQDFETNDELK 
CKVIGFLEEV^DPELLTQERKAAANIIRTLTQEDPGDNQITLEEITQMAEGVKAEPFENHSAIjEIAEQLTLIiDHLVFKK 
IPYEEFFGQGWMKLEKNERTPYIMKTTKHFNDISNLIASEIIRNEDINARVSAIEKWVAVADICRCLHNYNAVLEITSSM 
NRSAIFRLKKTWLKVSKQTKALIDKLQKLVSSEGRFKNLREALKNCDPPCVPYLGMYLTDIiAFIEEGTPNYTEDGLVNFS 
KMRMISHIIREIRQFQQTAYKIEHQAKVTQYItLPQSFVMDEESLYESSLRIEPKLPT 



TheNOV9 amino acid sequence has 807 of 813 amino acid residues (99%) identical 
to, and 809 of 81 3 amino acid residues (99%) similar to, the 814 amino acid residue 
unidentified protein of Sequence 1 from Patent W09321314 (ptnr:REMTREMBL- 
ACC:CAB58578) (E - 0.0). 

NOV9 is expressed in at least the following tissues: adrenal gland, bone marrow, brain 
- amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, brain - thalamus, 
brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, lymphoma - Raji, 
mammary gland, pancreas, pituitary gland, placenta, prostate, salivary gland, skeletal muscle, 
small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, and uterus. This 
information was derived by determining the tissue sources of the sequences that were included 
in the invention including but not limited to SeqCalling sources, Public EST sources, 
Literature sources, and/or RACE sources. 

NOV9 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 9C. 
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Table 9C. BLAST results for NOV9 


Gene Index/ 
I dent i f ieir 


Protein/ Organism 


Length 
. (aa) 


Identity 
• (%) 


Positives 
(•%) 


Expect 


72 | GNRP HUMAN 


guanine 
nucleot ide 
releasing protein 
(GNRP) (RAS- 

specif ic 
nucleotide 
exchange factor 1 
CDC25) [Homo 
sapiens] 


1275 


1208/127 
5 (94%) 


' 1212/1275 
(94%) 


0.0 


gii228955jprf | [1814 
463A 

i 


guanine 
nucleotide- 

[Rattus 
norvegicus] 


1244 


1032/125 
8 (82%) 


1118/1258 
(88%) 


0.0 


gi | 25222 08 |gb}AAB80 


Ras-GRF2 ' [Homo 
sapiens] 


1237 


816/1266 
(64%) 


974/1266 
(76%) 


0.0 


953. 1| (AF023130) 


gi | 6755288 | ref |NP 0 


RAS protein- 

o^Jtrt LiJ. u yudiiJLiJc- 

nucleotide- 
releasing factor 

1; RAS guanyl 
releasing protein 
3 ; GRF beta; Ras 
guanine release 
factor beta [Mus 
musculus] 


1262 


1035/126 
5 (81%) 

t 


1116/1265 


0.0 


3 53 75 . 1 } 
(NM_011245) 


qijll360360|pir| | T4 


guanine 
nucleotide 
release/exchange 
factor Ras-GRF2 - 
mouse [Mus 
musculus] 


1189 


554/834 
(66%) 


654/834 
(77%) 


0.0 


2726 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 9D. 



Table 9D Information for the ClustalW proteins 

1) NOV9 (SEQ ID NO;20) 

2) gili3124259lsplQ139721GNRP HUMAN guanine nucleotide releasing protein (GNRP) (RAS-specifk 
nucleotide exchange factor CDC25) [Homo sapiens] (SEQ ED NO:l 1 1) 

3) gif228955|prf][l 8 1 44 63 A guanine nucleotide-releasing factor [Rattus norvegicus] (SEQ ID NO: 1 12) 

4) gil2522208lgbl AAB809S3.1| (AF02313O) Ras-GRJF2 [Homo sapiens] (SEQ ID NO: 11 3) 

5) gii6755288iref]NP 035375.11 (NM_01 1 245) RAS protein-specific guanine nucleotide-releasing factor 1 ; RAS 
guanyl releasing protein 3; GRF beta; Ras guanine release factor beta [Mus musculus] (SEQ ID NO: 1 14) 

6) gill 1360360lpir[lT42726 guanine nucleotide release/exchange factor Ras-GRF2 - mouse [Mus musculus] 
(SEQ ID NO: 115) 



NOV9 



91 
9i 

gi 
gi 
gi 



13124259] 
228955 j 
2522208 | 
6755288) 
113603601 



N0V9 

gi|l3124259i 
gi|228955| 
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gi|2522208 | 
gij 6755288 | 
gi 1 11360360 j 



N0V9 

gi| 13124259 | 
gij 228955 | 
gi|2522208 j 
gi | 6755288 | 
gij 11360360 | 



N0V9 

gi| 13124259 | 
gij 228955 | 
gi | 2522208 | 
gi | 6755288 j 
,gi| 11360360 | 



NOV 9 

gi|l3124259 | 
gij 228955 | 



gi 

gi 
gi 



2522208 j 
6755288 j 
11360360] 



N0V9 



gi 
gi 



13124259 | 
228955 j 
gi 2522208j 
gi 6755288| 
gi 11360360| 



N0V9 

gi}l3124259 | 
gi|228955 | 
gi j 2522208 | 
gij 6755288 j 
gij-11360360 | 



NOV? 

gi| 13124259 j 
gi|228955| 
gi|2522208 j 
gij 6755288 [ 
gij 11360360| 



N0V9 

gi 1 13124259 j 
gij228955| 
gi J2522208 | 
gi|6755288 j 
gijll360360| 



N0V9 

gi|l3124259| 
gij228955| 
gi|2522208 | 
gij 6755288 | 
gi|H360360l 



gsgEgTgdgpjgAGAGQGGVRDAjjD 

SilgKgAjS^^GTgsSSsG EI< £E 

EsgESTgAQPETN^GPAGARDASD^ 



KQyYFTVLF 




•I. 



I 



230 



250 
|..,..J. 



260 



270 



KKIKKVQS FLRGWLCRRKWKTI IQD YIRSPHADS 
KKX KKVQ S FLRGWLCRRKWKTI I QDYI RSPHADS 
KKIKKVQSPLRGWLCRRKWKjJJriQDyiRSPHADSM 
KKIKKVQSF^G^LCRRKWKTrEQDYlgSPHAjlSMRKR] 

kkikkvqsflrgwxicrrkwi^ll iqdyirsphadsmrkm 
kkikkvqsfErgw^rrkwktiBqdyiHsphrSsmiucri 



wfsmleaeaeyvqqlhil 
qwfsmleaeaeyvqqlhiiiv 
qwfsmleaeaeyvqqlhi lv 
[qQvfSmQbas^eyvQqlSxlv 
qvvfsmleaeaeyvqqlhilv 
qSvfHwSeaeHsyvB]qi|Silv 



280 

153 

m 



giLVSGFLRP 



420 




GERQCFIiFSKHIillCTRGSGGKliHLTKgG 
L^KEGERQ CFLFS KHLI I CTRGS GGKLHLTkSg 
LKKEGERQCFLFSKHLI ICTRGSG^KLHLTK^G 
LKKEGERQCFLFgKHlHlCTRlSGGKIjHlflKTG 
KKEGERQCFLFSKHLirCTRGSGGKItHLTKjjG 

lkkegerqcflfSkhBSictrSsggklhiQictg 




740 



750 



770 
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NOV9 

gi | 13124259 | 
gij 228955 | 
gi (2522208 | 
gij 6755288) 
gi|ll360360| 



NOV9 

gi|l3124259| 
gi[228955 | 
gi|2522208| 
gi|6755288i 
gi|ll360360| 



N0V9 

gi|l3124259| 
gij 228955J 
gi[2522208j , 
gi|6755288| 
gijll360360| 



N0V9 

gijl3124259] 
gi (228955 | 
gi|2522208) 
gij 6755288 j 
gijll360360| 



N0V9 

gi|l3124259 | 
gi|228955 j 
gi|2522208| 
gi|6755288| 
gi|ll360360| 



N0V9 

gi |13124259| 

gij228955| 

gi|2522208| 

gi]6755288| 

gi|ll360360| 



N0V9 

gi jl3124259] 

gi(228955| 

gi|2522208j 

gi|6755288| 

gi|ll360360| 



N0V9 

gi|l3124259| 
gi | 228955 | 
gi|2522208| 
gi |6755288| 
gi|ll360360| 




1060 1070 




1080 
..]... 



1110 



S I AEQLTLIjDHIiVFK Jipyeeffcqg* 
siaeqltlldhlvfkSipyeeffgqgi* 



kSertpyimktt mm 

SIAEQLTLLDHLVFKS I PYEEFFGQGWMKj3gKSERTPYIMKTT gHFjg 
E3AE€^jTLIiDHEp|siPYEEF[lGQGWMI^ kSeRTPYIMICI^ mm 
ElAEQLTLLDHLVFKSIPYEEFFGQGWMKgg KYERTPYIMKTT mm 
lEffAEOffTLIJHnVF^SIPYEEFgGQGWMiaS kSeRTPYIMKTE ShfS 



1130 



SsSliasei 
SsSliasei 

JsBjlASEIj 





1200 1210 
J...-I I . 



CB PPCWP YLGMYLTDLAFIEEGTPEjYT^ I RE I RQ FQQTSYKI 

LRESL^OJPPCVPYtiGI^LTDLAFIEEGTpSyTEDGLvSFSKMRMISHIIREl 
LRE«LgslCU3PPCVPYI^MYLTO 
LRE^^aJppjJvPYIKStmjTDIiRF^ 
LRlSI«CDPPCOTYLG^rn^TDI^ 

LREML]^CnRpp9\^YLGMYliTDl^F IEEGTpSSTEj^ItVSFS KMRMI SHI I RE IRQFQQT*^ 
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gi|6755288| 
gij 11360360] 

Tables 9E - 9 J list the domain description from DOMAIN analysis results against 
NOV9. This indicates that the NOV9 sequence has properties similar to those of other 
proteins known to contain these domains. 



Table 9E. Domain Analysis of NOV9 

gnl | Smart | smart00l47 , RasGEF, Guanine nucleotide exchange factor for 
Ras-like small GTPases (iSEQ ID n6:116) 
Length = 242 residues, 99.6% aligned 

Score = 234' bits (59B) , Expect « 2e-62 



NOV9: 


1018 


00147: 


1 1 


NOV9 : 


1077 


00147: 


61 


NOV9: 


1137 


00147 : 


121 


NOV9: 


1196 


00147 : 


181 


NOV9: 


1254 


0 014 7 : 


241 



FENHSAL^lAEOLTLLDHLVFKKIPYEEFFGQGWMKLEKNERTpy - IMKTTKHFNDISNL 

Mimui +1+11 iiiri +i + + + U++U 

IiIiLLDPKELAEQLTLLDFELFRKIDPSELLGSVWGKRSKKSPSPLNLERFIERFNEVSNW 



KQTKALIDKLQKLVeSEGRFKNLREALKNC-DPPCVPYLGMYLTDLAFIEEGTPNYTEDG 

+ I I ++I++I+ 111 1 1 1 1 +1 III + I + H+ I 'II ll + ll i++ ++I 

SKYKKLFEELEELIiDPSRNFKNYREALSSCNLPPCIPFLGVLLKDLTFIDEGNPDFLKNG 



1076 



60 



IASEIIRNEDINARVSAIEKWVAVADICRCLHNYNAVLEITSSMNRSAIFRLKKTWLKVS 1136 
+ | + || ++ | + | ++ || |1 | + | + | + + + | |++ + '| | I | | | | | | + 

VATEILKQTTPKDRAELLSKFIQVAKHCRELNNFNSLMAIVSALSSSPISRLKKTWEKLP 120 



1195 



180 



1253 



INI I I 1+ Mill! I I + I II II + +! Ml Mllll 

IjVNFEKRRKIAKILREIRQLQSQPYNLRPNRSDIQSLLQQSLDSLPEENELYELiSLRIEP 240 
K 1254 
R 241 



Table 9F. Domain Analysis of NOV9 

gnl | Pf am | p£am00617 , RasGEF, RasGEF domain. Guanine nucleotide exchange 
factor for Ras-like small GTPases. (SEQ ID NO: 1X7) 
Length - 188 residues, 99.5% aligned 

Score = 224 bits (570), Exp ect 3e-59 



NOV9: 


1020 


00617: 


2 


NOV9: 


1079 


00617 : 


62 


NOV9: 


1139 


00617: 


122 


NOV9: 


1198 


00617: 


182 



ll+l+lllll+l + 1111 I II I I llll+l I II l + ll ++I + 

LliDPLELAKOLTLLEHELFKKIDPFECLGOVWGKKyGKNERSPNIDKTIKNFNQLTNPVG 61 



+ 1+1 I I + I+ + III II I+I+I++I I I++ I l+llllll I I 
TTILLQTDPKKRAELIQKFIQVADHCRELNNFNSLLAIISALySSPIYRLKKTWQyVPPQ 121 

TKAL1DKLQKLVSSEGRFKNLREALKNCDP- PCVPYIjGMYLTDIjAFIEEGTPNYTEDGLV 1 197 

+ I ++I 11+ 1+ I I II 11+ I 1111+ |+||*|| l+lll I++ I II 
SLKLFEELNKLMDSDRNFSNYRELLKSIFPLPCVPFFGVYLSDLTFLEEGNPDFLETNLV 181 
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Table 9G. Domain Analysis of NOV9 

gnl | Pf am [pf am00621 , RhoGEF, RhoGEF domain. Guanine nucleotide exchange 

factor for Rho/Rac/Cdc4 2~like GTPases Also called Dbl -homologous (DH) 

domain. It appears that pfamo6l69 domains' 'invariably occur O terminal 

to RhoGEF/DH domains. (SEQ ID NO: 11 8) 

Length = 182 residues, 100.0% aligned 

Score * 192 bits (487) , Expect = le-49 



NOV9 : 


244 


W F S ML E AE AE Y VQQLH I LVNN F L, RP LRMAAS S KKP P I THDD VS S I F LNSET I M FLHQ I F 


3 03 






1+ + 111 + 11 + Mill +++MI II II 1 +1 1 1+ +11 1 1 1+ 1 




00621: 


1 


VLKELLETEKKYVRDLE I LVNVYMKPLREAAI SK- PVLTPDpLETI FSNINEI YEFHREF 


59 


NOV9 : 


304 


YQGLKARI SSWP - TLVLADLFDI LLPMLNI YQEFVRNHQYSLQI LAHCKQNRDFDKLLKH 


362 






+ 1+ mm i i iii 1 1 i ii i+ i i+ ++i i i i + r 




00621: 


60 


LKSLEDRISSSPSAPRLGDLFLKLEPFLQIYGEYCANKPYAQELLEKCSSNPQFAEFLDE 


119 


NOy9 : 


363 


YEAKPDCEERTLETFLXYPMFQIPRYILTLHELLAHTPHEHVERNSLDYAKSKLEELSRI 


422 






ii + + n++ i i+ +mim i iimii + +i i r 1++1++ 




00621 : 


120 


VEASSNTGKLTLQSLLLKPVQRIPRYPLLLKELLKHTPEDQPDREDLKKALDLLQDLAKS 


179 


N0V9: 


423 


MHD 4 25 








+++ 




00621: 


180 


INE 182 





Table 9H. Domain Analysis of NOV9 

gnl ] Smart | smart 0 03 2 5 , RhoGEF, Guanine nucleotide , exchange factor for 
Rho/Rac/cdc42-like GTPases; Guanine nucleotide exchange factor for 
Rho/Rac/Cdc42-like GTPases Also called Dbl -homologous (DH) domain. It 
appears that PH domains invariably occur C- terminal to RhoGEF /DH 
domains. Improved coverage. (SEQ ID NO: 119) 
Length = 181 residues, 100.0% aligned 

Score = 139 bits (349) , Expect = le-33 



NOV9 : 


244 


WFSMLEAEAEYVQQLHILVNNFLRPLRMAASSKKPPITHDDVSSIFLNSETIMFLHQIF 


303 






1+ +1+ 1 11 + 1 III 11 + 11+ 1 ++ l + l ++I 1 II Ml 




00325 : 


1 


VLKELLQTERN YVRDLKI LVEVFLKPLKKEAK LLSPDEVETLFGNIEEIYEFHRIF 


56 


NOV9 : 


304 


YQGLKARI SSWPTLV LADLFDILLPMLNIYQEFVRNHQYSLQILAHCK-QNRDFDKLL 


360 






1+ 1+ 1 + 1+1 1 + II 1+ II + I++I 1 +1+ 1 II 




00325: 


57 


LDELEKRVEEWDDSGDRIGDVFLKLEELFKIYSEYCSNHPDALELLKKLKKKNKRFQKFL 


116 


NOV9 : 


361 


KHYEAKPDCEERTLETFLTYPMFQIPRYILTLHELLAHTPHEHVERNSLDYAKSKLEELS 


420 






1 1+ l + l 11+ 1 1+ ++ +1 1 1 III III +| +| 1 1 ++II+ 




00325: 


117 


KEI ESNPNCRRLELESLLLKPVQRLTKYPLLLKELLKHTPPDHEDREDLKKALDAI KELA 


176 


NOV9: 


421 


RIMHD 425 








++ + 




00325: 


177 


SQVNE 181 





Table 91. Domain Analysis of NOV9 

gnl | Pfamjpf am00618 , RasGEFN, Guanine nucleotide exchange factor for 
Ras-like GTPases; N- terminal motif. A subset of guanine nucleotide 
exchange factor for Ras-like small GTPases appear to possess this 
motif /domain N- terminal to the RasGef (Cdc25-like) domain. (SEQ ID 
NO:120) 

Length = 56 residues, 96.4% aligned 

Score = 62.8 bits (151), Expect = le-10 



NOV9 : 629 SCKVLQI RYASVERLLERLTDLRFLSI DFLNTFLHS YRVFTTAI WLDKLI TI Y 682 

II l + ++I l+l MM + 1+ Ml +11 II +M ||+ I 
00618 : 1 YDKVGSIKGGTLEALIEYLTDLESEDLFFVETFLLTYRSFITTQELLDLLISRY 54 
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Table 9J. Domain Analysis of NOV9 

gnl [Smart 1 smart00233 / PH, Plecjcstrin homo^gy . domain. ; Domain commonly 
found in eukaryotic signalling proteins. The domain family possesses 
multiple functions including the abilities to bind inositol 
phosphates, and various proteins. PH domains have 'been found to 
possess inserted domains (such as in PLC gamma, syntrophins) and to be 
inserted within other domains. Mutations in Brutons tyrosine kinase 
(Btk) within its PH domain cause X-linked agammaglobulinemia (XLA) in 
patients. Point mutations cluster into, the positively charged end of 
the molecule around the predicted binding site for 
phosphatidyl inositol lipids. (SEQ ID NO: 121) 1 
Length = 104 residues, 98.1% aligned 

Score = 58.9 bits (141), Expect = 2e-09 



NOV9: 24 TRKGYLSKRSSDNTK- WQTKWFALLQNLLFYFESD- - - SSSRPSGLYLliEGCVCDRAPSP 79 , 

++I+I Ml I I* ++I I +1 lll+l I I II II 

00233 : 2 IKEGWLLKKSSGGKKSWKKRYFVLFNGVLLYYKSKKKKSSSKPKGSIPLSGCTVREAPDS 61 

NOV9: 80 KPALSAKEPLEKQHYFTVNFSHENQKALELRTEDAKDCDEWVAAIAHASYR 130 

' ' + + + I + ++I I 1+ I ++ II I 1+ I + 

00233: 62 DSDKKKNCFEI - - VTPDRKTLLLQAESEEERKEWVEALRKAIAK 103 

5 

The stimulation of a variety of cell surface receptors promotes the accumulation of the 
active, GTP-bound form of Ras proteins in cells, which is a critical step in signal transduction. 
To reach the active GTP-bound state, Ras proteins must first release bound GDP. This rate- 
limiting step in GTP binding is thought to be catalyzed by a guanine-nuclcotide-relcasing 

1 0 factors (GRF) (Overbeck et. aL, Mol. Reprod. Dev., 42:468-476, 1995). The exchange factor 
Ras-GRFl, also called CDC25Mm, couples calcium signaling and G-protein-coupled 
receptors to Ras and downstream effectors. Ras-GRFl has also been shown to strongly 
enhance the level of active Ras (Ras-GTP) and the activity of mitogen-activated protein 
kinases (MAPK) in different cell lines (Zippel et ah, Exp. Cell Res. 258:403-408, 2000). The 

15 present invention is a novel variant of previously described Ras-GRFl (Schweighoffer and 
Tocque,1994, GenSeq: AC R43578) that antagonises the interaction of GDP-exchange 
factor(GRFs) with the p21-GDP complex and thereby regulates the activity of ras gene 
products. 

The protein similarity information, expression pattern, and map location for the NOV9 
20 suggest that NOV9 may have important structural and/or physiological functions characteristic 
of the Guanine Nucleotide Releasing protein family. Therefore, the NOV9 nucleic acids and 
proteins of the invention are useful in potential therapeutic applications implicated in various 
diseases and disorders described below and/or other pathologies. For example, the NOV9 
compositions of the present invention will have efficacy for treatment of patients suffering 
25 from myocardial hypertrophy, high-grade gliomas, cancers such as breast cancer, cervical 
cancer, lung cancer, pancreatic cancer and/or prostate cancer. The NOV9 nucleic acid 
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encoding Guanine Nucleotide Releasing Protein-like protein, and the Guanine Nucleotide 
Releasing Protein-like protein of trie invention, or fragments thereof, may further be useful in 
diagnostic applications, wherein' the presence or amount of the nucleic acid 'or the protein are 
to be assessed. . . 

NOV10 

A disclosed NOV10 nucleic acid of 748 nucleotides (designated CuraGen Acc. No. 
133268995_dal) encoding a novel Interleukin-l-like protein is shown in Table 10A. An dpen 
reading frame was identified beginning with an ATG initiation codon at nucleotides 1-3 and 
ending with a TAG codon at nucleotides 412-41 A. A putative untranslated region downstream 
from the termination codon is underlined in Table 10A, and the start and stop codons are in 
bold letters. 



Table 10A. NOV10 Nucleotide Sequence (SEQ ID NO:21) 



ATGGTCCTGAGTGGGGCGCTGTGCTTCAGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGCATAATAACC^GCTTC 

tagctggagggctgcatgcagggaaggtctccctcctagagaagatctgcatacttcctaacagaggcttggcccgcac 
caaggtccccattttcctggggatccagggagggagccgctgcctggcatgtgtggagacagaagaggggccttcccta 
cagctggagcagccagtgaacatcatggagctctatcttggtgccaaggaatccaagagcttcaccttctaccggcggg 
acatggggctcacctccagcttcgagtcggctgcctacccgggctggttcctgtgcacggtgcctgaagccgatcagcc 

TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCCAGCAGTGTGACTAG 
GGCAAC 



15 



20 



25 



The nucleic acid sequence of NOV 10 maps to chromosome 2 and has 426 of 480 bases 
(88%) identical to zHomo sapiens interleukin-1 receptor antagonist homolog (IL1HY1) 
mRNA (gb:GENBANK-ID:AF186094|acc: API 86094.1) (E= 3.9e 77 ). 

A NOV10 polypeptide (SEQ ID NO:22) encoded by SEQ ID NO:21 is 157 amino acid 
residues and is presented using the one. letter code in Table 10B. Signal P, Psort and/or 
Hydropathy results predict that NOVl 0 is likely to be localized extracellularly. 



Table 10B. NOV10 protein sequence (SEQ ID NO:22) 



^LSGALCFRMKDSALKVLYLHNNQLI^GGLHAGI^ 

PVNIMELYLGAKBSKSFTFYRRPMGLTSSFESAAYPGWFLCTVPEADQPVRLTQIiPENGGWNAPITDFYFQQCD 



The NOV10 amino acid sequence has 131 of 157 amino acid residues (83%) identical 
to, and 141 of 157 amino acid residues (89%) similar to, a Homo sapiens 155 amino acid 
residue FIL1 delta (interleukin-1 like protein 1) (interleukin-1 receptor antagonist homolog 1) 
(interleukin-1 delta) (ptnr:SPTREMBL-ACC:Q9UBH0) (E= 1.3e 66 ). 

NOV 10 is expressed in at least the following tissues: placenta. This information was 
derived by determining the tissue sources of the sequences that were included in the invention 
including but not limited to SeqCalling sources, Public EST sources, Genomic Clone sources, 
Literature sources, and/or RACE sources. In addition, NOV 10 is predicted to be expressed in 
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placenta tissues because of the expression pattern of a closely related Homo sapiens 
interleukin-1 receptor antagonist homolog (IL1HY1) mRNA (gb:GENBANK~ 
3D:AF186094|acc:AF186094.1). ' • 

NOV10 also has homology to the amino acid sequences shown in the BLASTP data 
5 listed in Table 10C. 



Table 1 0C. BLAST re$u Its for NO VI 0 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
<%) 


Positives 
(%) * 


Expect 


gi | 6912432 jrefjNP 0 


similar to 
VITELLINE MEMBRANE 
OUTER LAYER 
PROTFTN T 

JT J\\J _L Hi JL IV J. 

PRECURSOR (VMO-I) 
(VMOI) (H . 
sapiens) [Homo 
sapiens] 


155 


131/157 
(83%) 


141/157 
(89%) 


3e-70 

i 


36407.1! 
(NM_012275) 

J-ULCi. ACU/VIU JL. 

receptor antagonist 
homolog 1 [Homo 
sapiens] 


qi j 9506807 J ref |NP 0 


VITELLINE MEMBRANE 
OUTER LAYER 
PROTEIN I 
PRECURSOR (VMO-I) 
(VMOI) [Gallus 
gallus) 


155 


126 /157 
(80%) 


13 9/157 
(88%) 


6e~ 68 


62324. 1| 
(NM_019451) 
interleukin 1 
family, member 5 
(delta) ; 
interleukin 1 
receptor antagonist 
homolog 1 [Mus 
musculus] 


gi | 9651791 (gb|AAF91 
275.l|AF230378 1 
(AF230378) 
interleukin-1 delta 
[Mus musculus] 


Chain A, Vitelline 
Membrane Outer 
Layer Protein I 
[Gallus gallus] 


156 


126/157 
(80%) 


139/157 
(88%) 


6e-68 


gi | 18025344 | gb J AAK3 
3010. 1| (AY029413) 
interleukin-1 
receptor 
antagonist -like 
FIL1 theta [Homo 
sapiens] 


fertilization 
envelope outer 
layer protein 
[Cyprinus carpio] 


152 


84/139 
(60%) 


95/139 
(67%) 


9e-36 


gi | 14573319 jgb|AAK6 
8048 .1 [AF334755 1 

(AF334755) 
interleukin-1 HY2 
[Homo sapiens] 


Y9C9A. l.p 
[Caenorhabditis 
elegans] 


152 


82/139 
(58%) 


93/139 
(65%) 


le-34 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 10D. 

Table 10D ClustalW Analysis of NOV10 

1) NOV10(SEQJDNO:22) 

2) gil69l24321reflNP_0 36407.1[ (NM_012275) interleukin-1 receptor antagonist homolog 1 [Homo sapiens] 
(SEQ ID NO: 122) 

3) gil9506807lrefTNP 062324.11 (NM_019451) interleukin 1 family, member 5 (delta); interleukin 1 receptor 
antagonist homolog 1 [Mus musculus] (SEQ ID NO: 123) 

4) gi|965 1 791 j gb|AAF9 1 275 . 1 1AF2303 78_1 (AF230378) interleuknvl delta [Mus musculus] (SEQ ID NO:124) 

5) gil 1 8025344|gbtAAK33Q 10,1 1 (AY029413) interleukin- 1 receptor antagonist-like FIL1 theta [Homo sapiens] 
(SEQ ID NO: 125) 
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6) gi )14573319. | gblAAK68048JlAF334755 1 (AF334755) interleukin-1 HY2 [Homio sapiens] (SEQ ID 
NO:126) t ■ l( 



NOV10 

gi|6912432| 
gi|9506807j 
gij 9651791} 
gi|l8025344 | 
gi|l4573319| 



NOV10 

gij6912432[ 
gij 9506807 | 
gij965179l| 
gi|l8025344 | 
gi|l4573319j 



gi 
gi 
gi 
gi 



6912432 | 
9506807j 
9651791 j 
18025344 | 
14573319 j 




Tables 10E and 10F list the domain description from DOMAIN analysis results against 
NOV 10. This indicates that the NOV10 sequence has properties similar to those of other 
proteins known to contain these domains. 



Table 10E. Domain Analysis of NO VI 0 

gnl j Smart | smart0012S , IL1, Interleukin-1 homologues ; Cytokines with 
various biological functions. Interluekin 1 alpha and beta are also 
known as hematopoietin and catabolin. (SEQ ID NO: 127) 
Length - 148 residues, 96.6% aligned 

Score = 123 bits (309), Expect = 7e-30 , 



NOV10 : 


9 


00125 : 


6 


NOV1 0 : 


67 


00125 : 


66 


NOV10: 


127 


00125: 


125 



FRMKDSMjKVLYLHN- NQLIiAGGLHAGKVSLLEKI CILPNRGLART- KVPI FLGIQGGSR 6 6 

I 1+ 1+ MM III + + I + +1 1+1+ 111+ + 

FRLNDANQKSLVLANPQYLKALHLQGQNLNQEVKFDMSFVQGEEEDSKI PVTLGIKKKNL 6 5 

CLACVETEEGPSLQI.EOPVNIMEriYLGAKESKSPTFYRRDMGLTSSFESAAYPGWFLCTV 12 6 

1 + 11+ + l+llll + + +' M I + ++I + lllllll 11+ I 

YLSCVKKGDKPTLQLEMV-DPKKYPKNKEMEKRFVFEKHEIGNKNEFESAAYPNWFISTS 124 



I l + M I I 



mi i 

-GQDITDFQMQ 14 8 



10 



Table 10F. Domain Analysis of NOV10 

gnl | P f am|pf am003 4 0 , IL1, Interleukin-1 / 18. This family includes 
interleukin-1 and interleukin-18 . (SEQ ID NO: 128) 
Length »= 142 residues, 98.6% aligned 

Score « 112 bits (280), Expect = 2e-2 6 



NOV10: 11 



0034 0: 



MKDSALKVLYLHN-NQLIjAGGLHAGKVSLLEKICILPNRGLARTKVPIFLGIQGGSRCLA 69 

+ 1+ lllllll 1+ + + + +I+I+ III I + 1+ 

LNDANQKSLVLANPN YLKALHLNGLNQE VKFDMSFVQG - EPHDSKI PVTLG I SGTNL YLS 5 9 
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NOV10: 70 CVETEEGPSLQLEQPVNIMEL,YLGAKESKSFTFYRRDMGLTSSFESA^YPGWFIjCTVPEA 129 

11+ +17111 " + ++ 1 I I' + ++I 1 1 1 1 1 1 1 1 1+ 'I I 

0034 0: 60 CVKEGDEPVLOLEMV-EPPKYIKNSEMDKRFFFEKTEIGSKVYFESAAYPNWFlATKQEE 118 

I : , f 

NOV10: 13 0 DQPVRLTQLPENGGWNAPITDFYFQQ . ,155 

'l^lli I + Mil 

00340: 119 DRPVFLANGPP ESDITDFQIEE 140 

Interleukin-1 is a cytokine with a wide range of biological and physiological effects, 
including fever, prostaglandin synthesis (in e.g., fibroblasts, muscle and endothelial cells), T- 
lymphocyte activation, and interleukin 2 production. This family is a member of a superfamily 
that also contains the heparin binding growth factors (HBGF), the Kunitz-type soybean trypsin 
inhibitors (STI) and histkctophilin. All have very similar structures, but although the 
inter! diking and HBGF families share some sequence similarity (about 25%), they show none 
at all to the STIs. The interleukhvl family consists of 2 main classes, designated alpha (ILIA) 
and beta (IL1B), as well as the more recently discovered mterleukin 1 receptor antagonist 
(IL1RA). Sequence similarity is high within the ILIA and IL1B subfamilies (about 60-70%) 
but low between them (less than 30%). IL1 As and ILlBs are synthesised as larger precursors, 
which are processed to give mature carboxy fragments. IL1B requires this cleavage to become 
biologically active, but ILIA precursor is already active. Both ILIA and IL1B bind to the 
same IL1 -specific receptor on the target cell, which is then internalised to initiate the relevant 
effects. IL1RA binds to the IL1 receptor, blocking the effects of ILIA and IL1B whilst 
eliciting no response of its own. From sequence comparisons it seems to have arisen by gene 
duplication before IL1 diverged into ILIA and IL1B. The crystal structures of ILIA and IL1B 
have been solved, they share the same 12-stranded beta- sheet structure as both the heparin 
binding growth factors and the Kunitz-type soybean trypsin inhibitors. The beta-sheets are 
arranged in 3 similar lobes around a central axis, 6 strands forming an anti-parallel beta-barrel. 
Several regions, especially the loop between strands 4 and 5, have been implicated in receptor 
binding. 

The cytokine interleukin-1 (IL1) elicits a wide array of biologic activities that initiate 
and promote the host response to injury or infection by activating a set of transcription factors, 
including NFKB and API, which in turn produce production of effectors of the inflammatory 
response. Using a high-throughput cDNA screening technology and BLAST searching, 
followed by additional library screenings, RT-PCR, and 5-prime RACE analysis, Mulero et al. 
(Biochem Biophys Res Commun. 263(3):702-6, 1999) isolated a cDNA encoding a novel 
member of the interleukin-1 family, which they termed IL1HY1. The deduced 155-amino acid 
protein shares 52% sequence identity with IL1RA. It contains 3 of 4 highly conserved cysteine 
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residues and an aspartate at position 148, which is cognate to aspl45 in IL1B and has been 
shown to impart agonist activity to IL1B. It does not contain a signal peptide or a prodomain. • 
PCR analysis revealed expression in leukocytes, spleen, and brain as well as in fetal brain and 
most abundantly in a fetal skin library. RT-PCR analysis also established that IL1HY1 
5 expression is amplified in a stimulated macrophage cell line. 

The above defined information for NOV10 suggests that this NOV10 protein may 
function as a member of a Interleukin-1 protein family. Therefore, the NOV10 nucleic acids 
and proteins of the invention are useful in potential therapeutic and diagnostic applications. 
For example, a cDNA encoding the NOV10 protein may be useful in gene therapy, and the 

10 NOV10 protein may be useful when administered to a subject in need thereof. By way of 
nonlimiting example, the compositions of the present invention will have efficacy for 
' treatment of patients suffering from bone metabolism disorder; proinflammatory disorder; 
immune disorder; inflammatory disease; septic shock; stroke; diabetes; arthritis; intercolitis; 
pneumonitis; epithelial cell; skin disease; proliferative disorder; skin cancer; melanoma; 

15 Kaposi's sarcoma; epithelial cancer; squamous cell carcinoma; bone resorption disorder; 

osteoporosis; Paget's disease; osteoarthritis; degenerative arthritis; osteogenesis imperfecta; 
fibrous displasia; hypophosphatasia; bone sarcoma; myeloma bone disorder; osteolytic bone 
lesion; hypercalcemia; bone mass; bone fragility; bone pain; bone deformity and/or bone 
fracture. The NOV10 nucleic acid encoding Interleukin-1 -like protein, and the Interleukin-1 - 

20 like nrotein of the invention, or fragments thereof, may further be useful in diagnostic 
applications, wherein the presence or amount of the nucleic acid or the protein are to be 
assessed. 

NOV11 

A disclosed NOV11 nucleic acid of 1599 nucleotides (designated CuraGen Acc. No. 
25 CG56093-01) encoding a novel Interleukin-1 signal transducer- like protein is shown in Table 
1 1 A. An open reading frame was identified beginning with an ATG initiation codon at 
nucleotides 6-8 and ending with a TAG codon at nucleotides 1584-1586. Putative 
untranslated regions upstream from the initiation codon and downstream from the termination 
codon are underlined in Table 1 1 A, and the start and stop codons are in bold letters. 

Table 11A. NOY11 Nucleotide Sequence (SEQ IB NO:23) 

^TTACT ATGAGTCTGCTAAACTGTGAAAACAGGTGTGGATCCAGCCAGTCTCAAAGTGACTACTGTGTGGCCATGGCCAG 
CTCCTGTAGCGCAGCAACAAAAAATGATAGTGTGGGCAGAACTGCCAGCACGGGGAACCTCTTCAGCTCCTTTATGGAG 
GAGATCCAGGGATATGATGTAGAGTTTGGCCCACCCCTGGAAAGCAAGTATGAATGCCCCATCTGCTTGATGGCATTAC 
GAGAAGCAGTGCAAACGCCATGCAGCCATAGGTTCTGCAAAGCCTGCATCATAAAATCAATAAGGGATGCAGGTCACAA 
ATATCCAGTTGACAATGAAATACTGCTGGAAAATAAACTATTTCCAGACAATTTTGCAAAACGTGAGATTATTTCTCTG 
ATAGTGAAGTGTCCAAATGAAGGTTGTTTGCACAAGATGGAACTGAGACATCTTGAGGATCATCAAACACATTGTGAGT 
TTGCTCTTATGGATTGTCCCCAATGCCAGCCTCCCTTCCAAAA AT TCCATATTAATATTTACATTATGAAG GGTTGTCC 
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AAGGAGGCAGGTATCTTGTGACAACTGTGCTGCATCAATGGCATTTGAAGATAAAAAGATCCATGACCAGAACTGCCCT 
TTGGCAAATGTCATCTGTGAGTACTGCAATACrTATACTCATCAGAGAACAGATGCCTAACTTTATGACCTATGACCTAG 
ACCGCCCTACAGCCCCAATTCCATGCACATTCCGTACTTT^ 

CCACCTACAAGAG AACACCCAATCAAACATGAG AATGTTGG CCCAGGCTGTTCATAGTTTGAG CCTTATACCCGACTCT 
GGGTATATCTCAGAGGTCCGGAATTTCCAGGAAAGTATTCACCAGTTAqAGGGTCGCCTTG ( TAAGACAATGTCATCAAA 
TCCGGGAGCTG ACTGATAAAATGG AAACTCAG AGTATGTATGTAAGTGAG CT CAAACG AACC ATTCGAACCCT TGAGGA 
CAAAGTTGCTGAAATTGAAGCACAGCAGTGCAATGGAATTTACATTTGGAAGATTGGGAATGAATTTGAAATGTTAAGA 
AGAGCAGAAACCGAAGAGCAGAAACCTGTTGTGATTCATAGCCCTGGATTCTACACGGGTAAACACAGGTACAAACTGT 
GCATGCGCTTGCACCTTCCGTTACCGACTGCTCAGCGCTGTGCAAACTATATATCCCTTTTTGTCCACACAATGCAAGG 
AGAGTATGACAGCCACCTCCCTTGGCCCTTCCAGGATACAATATGCCTTACAATTCTTGATCAGTCTCAAGCACCTGTA 
AGGCAAAACCACGAAGAGATAATGGATGCCAAACCAGAGCTGCTTGCTTTCCAGCGACCCACAATCCCACGGAACCCAA 
AAGGTTTTGGCTATGTAACTTTTATGCATCTGGAAGCCCTAAGACAAAGAACTTTCATTAAGGATGACACATTATTAGT 
GCACTGTGAGGTCTCCACCCGCTTTGACATGGATAGCCTTCAGAGGGAGGGTTTTCAGCCACAAAGTACTGATGCAGGG 
GTATAGCTTGCCCTCACTT 



10 



15 



The nucleic acid sequence of NOV1 1 maps to chromosome 10 and has 712 of 856 
bases (83%) identical to a Mus musculus TRAF6 nxRNA (gb:GENBANK- 
rD:D84655|acc:D84655.1) (E = 3.0e' 242 ). 

A NOV1 1 polypeptide (SEQ ID NO:24) encoded by SEQ ID NO:23 is 526 amino acid 
residues and is presented using the one letter code in Table 1 IB. Signal P, Psort and/or 
Hydropathy results predict that NOV1 1 is likely to be localized extracellularly with a certainty 
of 0. 5050. 



Table 11B. NOV11 protein sequence (SEQ ID NO:24) 



MSLLNCENRCGSSQSQSDYCVAMASSCSAATKNDSVGRTASTGNLFSSFMEEIQGYDVEFGPPLESKYECPICLMALREAVQT 
PCSHRFCKACI I KS I RDAGHKY P VDNEI LLEN KLFPDN FAKRE 1 1 SLI VKCPNEGCIiHKMELRHLEDHQTHCEFALMDCPQCQ 
PPFQKFHINiyiMKGCPRRQVSCDNCAASMAFEDKKIHDONCPIiANVlCEYCNTILIREQMPNFMTYDLDRPTAPIPCTFRTF 
RSHEKMRRNHIARHLQENTQSNMRMLAQAVHSLSLIPDSGYISEVRNFQESIHQLEGRLVRQCHQIRELTDIO^ETQSMYVSEIj 
KRTlRTLEDKVAEIEAQQCNGIYIWKIGNEFEMLRRAETEEQKPWIHSPGFYTGKHRYKliCMRLHLPLPTAQRCANYISLFV 
HTMQGEYDSHIiPWPFQDTICLTIIjDQSQAPVRQNHEEIMDAKPELLAFQRPTI PRNPKGFGYVTFMHLEAIjRQRTFIKDDTLIj 
VHCE VSTRFDMD SLQREG FQ PQSTD AG V 



The NOV1 1 amino acid sequence has 475 of 526 amino acid residues (90%) identical 
to, and 493 of 526 amino acid residues (93%) similar to ? a Homo sapiens 522 amino acid 
residue putative Interleukin 1 signal transducer (ptnr : SPTREMBL- ACC : Q9 Y4K3 ) (E = L2e" 



260 



NOV1 1 also has homology to the amino acid sequences shown in the BLASTP data 
listed in fable 11 C. 



Table 11 C. BLAST results for NOV11 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


qi I 4759254 j ref |NP 0 


TNF receptor- 
associated factor 
6 [Homo sapiens] 


522 


475/526 
(90%) 


493/526 
(93%) 


0.0 


04611. 1| 
(NM 004620) 


qi|7513262|pir| |S71 
821 


probable 
interleukin 1 
signal - 
transducing 
protein TRAF6 
[Homo sapiens] 


522 


474/526 
(90%) 


492/526 
(93%) 


0.0 


qi | 6678429 | ref |NP o 


Tnf receptor- 
associated factor 
6 [Mus musculus] 


530 


426/534 
(79%) 


467/534 
(86%) 


0.0 


33450. 1| 
(NM__009424) 
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gi| 17472113 Jref |XP 
061503.1 I 
(XM__061503) i> 


similar to TNF 
receptor- 
associated factor 
6 [Homo sapiens] 


399 


235/271 
(86%) 


236/271 
(86%) 


e-131 


gi |17472115 | ref |XP- 


similar to TNF 
receptor- , , 
associated factor 
6 [Homo sapiens] 


225: 


*" 114/115 
(99%) 


116/115 • 
(99%) 


4e-60 


061504 .1 | 
(XMJ061504) 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 1 ID. 





90 
I 
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Table 11D ClustalW Analysis of NOV11 

1) NOV 11 (SEQIDNO:24) 

2) ei l4759254lref|NP 0046>1.1| (NM_004620) TNF receptor-associated factor 6 [Homo sapiens] (SEQ ID 
NO: 129) 

3) gi|7513262ipirllS7l82l probable interleukin 1 signal-transducing protein TRAF6 [Homo sapiens] (SEQ ID 
NO: 130) 

4) gj |6678429|ref| NP _033450. 1 1 (NM_009424) Tnf receptor-associated factor 6 (Mus musculus] (SEQ ID 
NO:131) 

5) gi 1 1 7472 3 1 3 [reflXP 06 1 503 . 1 1 (XM_061503) similar to TNF receptor-associated factor 6[Homo sapiens] 
(SEQ ID NO: 132) i • 

6) gill 747211 5 IreflXP 061504.1! (X!M_061504) similar to TNF receptor-associated factor 6 JfHomo sapiens] 
(SEQ ID NO: 133) , 



NOVll 

gi |4759254 j 
gi|7513262 j 
gi|6678429| 
gi|l7472113 | 
gi|l7472115| 



NOVll 

gi (4759254 | 
gij 7513262 j 
gi|6678429 j 
gi|l7472113 | 
gi (17472115 j 



NOVIj 

gi|4759254 j 
gi|7513262 j 
gi j 6678429 j 
gi (17472113 | 
gij 17472115 j 



NOVll 

gi|4759254 | 
gi|75l3262| 
gi |6678429j 
gi | 17472113 | 
gi|l7472115 j 



Novia 

gi|4759254[ 
gi|75l3262 j 
gij6678429| 
gi|l7472113| 
gi|l7472115 j 



'CKACIIKSIRDAGH 
'CKACIIKSIRDAGH 
'CKACI IKS IRDAGH 
'CKACXIKSIRDAGH 




J. 



KMBLRHIiEDHOHHCEFAlJSpCPQCQ 3PFQ 

kmelrhledh^cefaiSdcpqcc 3pfq 
kmelrhledh^hcefaiJdcpqcq Spfqi 

BKMELRHLEDHOfflHCEFAlfiScPQCQ 3PFQ 





I^LL^i]^isv2LDTYS2KSNCHLECViiPgW^;E 
320 330 340 350 




SHPYYgDEI LRNSVS SAVBjVLSgjHHSHHCFR 
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NOV11 

9i)4759254 | 



gi 



7513262 j 
6678429] 
17472113 | 
17472115| 



NOVll 

91 (4759254 | 
gi j 7513262 | 
gi[ 6678429] 
gijl7472113] 
gi j 17472115 I 



NOVll 

gi I 4759254 j 1 
gi j 7513262 | 
gi j 6678429 j 
gi j 17472113 | 
gij 17472115 j 



NOVll 

gi| 4759254 ] 
gi|7513262| 
gi j 6678429 j 
gi I 17472113 I 
gij 17472115] 



gx 
gi 
gi 
gi 
gi 



4759254 I 
7513262 j 
6678429| 
17472113} 
17472115 j 




560 



g YTEKGLDGPELKLGCTELLSKKKQT 
?HKY- P VDNEI 



570 



58,0 . 



600 
..|.. 



LMQKSIPHTSPHKHVSWAPVPLSPKIVCIYWALILQSNEVTITEDKFNNIjIKAAAVTVEPFWPSFFAKA 
LLEN 



650 



660 



670 



680 



690 



LASVNlGSLLCNAGVGRWLQQPAPHKQEVLPLPPIiLPQLRRGKWKQKKKNIiRSLTRTWALVFLTKLVL 



Tables 1 IE - 1 1H list the domain description from DOMAIN analysis results against 
NOV1 1. This indicates that the NOV1 1 sequence has properties similar to those of other 
proteins known to contain these domains. 



Table HE. Domain Analysis of NOVll 

gnl 1 Pf am} pf 80100917 , MATH, MATH domain. This motif has been called the 
Meprin And TRAF -Homology (MATH) domain. This domain is hugely expanded 
in the nematode C. elegans. (SEQ ID NO: 134) 
Length = 116 residues, 91.4% aligned 

Score ~ 66.2 bits (160), Expect 4e-12 



NOVll: 


375 


KPWIHSPGFYTGKHRYKLCMRLHLPLPTAQRCANYISLFVHTMQGEYDSHLPWPFODTI 


434 






| + + +|++ | ++ |++| ++| | | |+| | + 




00917: 


9 


KEGEE Y YTS PVEERFGI PWRLRIY RNGGFLGLYLHCLKGEKDSNLKWS I EAEF 


61 


NOVll : 


435 


CLTI LDQSQAPVRQNHEE I MDAKPELIjAFQRPTI PRNPKGFGYVTFMHLEATjRQRTFIKD 


494 






|+++ +++++ |||+ |+ + | ++ | 




00917 : 


62 




105 


NOVll : 


495 


DTLLVHCEV 5 03 








III++ II 




00917 : 


106 


DTLIIEAEV 114 
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Table 11F. Domain Analysis of NO VI 1 

gnl 1 Pf am 1 pf am02176 , zf-TRAF, TRAF- type zinc finger.' (SEQ ID NO: 135) 
Length = 57 residues, 100.0% aligned 

Score = 37.0 bits (84), Expect = 0.003 ' 



NOVll: 151 HQTHCEFA1/MDCPQ-CQPPFQKFHINIYIMKGCPRRQVSCDNCAASMAFEDKKIHDQ 206 

1+ II +11 I ++.++.11+111 + + + I 

02176 : 1 HEKTCPFVPVPCPNKCGKKILREDLPDHLSADCPKRPVPCPFKVYGCKVDMVRENLQ 57 



Table 11 G. Domain Analysis of NOVll 

gnl | Smart 1 smart00061 , MATH, meprin and TRAF homology (SEQ ID NO:i36) 
Length = 100 residues, 98.0% aligned 

Score ~ 39.3 bits (90), Expect - 6e-04 



10 



NOVll: 


356 


00061 : 


3 


NOVll: 


416 


00061: 


46 


NOVll: 


476 


00061: 


90 



SHTFKN VSKFEEGES- 



II + + II + 

-YFSPSEE HFNI PWRLKI - 



I ++II++ 
-YRKNGFLSLYL 4 5 



I +111 I + I ++ 1+ 

HCEKE ENDS - RKWS I E AEFTLKLVSQNGKSLS KKDTHV - 



| ++ | | + 
-FEKPG GW 89 



1+ 1+ + I 
GFSKFISWDDL 10 0 



Table 11H. Domain Analysis of NOVll 

gnl 1 Smart \ smart00184 , RING, Ring finger; E3 ubiquitin-protein ligase 
activity is intrinsic to the RING domain of c-Cbl and is likely to be 
a general function of this domain; Various RING fingers exhibit 
binding activity towards E2 ubiquitin-conjugating enzymes (Ubc ' s) 
(SEQ ID NO:137) 

Length = 41 residues, 87.8% aligned 

Score » 35.4 bits (80), Expect = 0.008 



NOVll: 70 CPI CLMAL - REAVQTPCSHRFCKACI I KS I RDAGHK 104 

Mill ++ I III II++II i + + 

00184: I CPICLEEYLKDPWLPCGHTFCRSCIRKWLESSNSN 36 



Many cytokines signal through different cell-surface receptors to activate the 
1 5 transcription factor NF-kappaB. Members of the TRAF protein family have been implicated in 
the activation of NF-kappaB by the tumour-necrosis factor (TNF)-receptor superfamily. Here 
we report the identification of a new TRAF family member, designated TRAF6. When 
overexpressed in human 293 cells, TRAF6 activates NF-kappaB. A dominant-negative mutant 
of TRAF6 inhibits NF-kappaB activation signalled by interleukin-1 (IL-1) but not by TNF. IL- 
20 1 treatment of 293 cells induces the association of TRAF6 with IRAK, a serine/threonine 
kinase that is rapidly recruited to the IL-1 receptor after IL-1 induction. These findings 
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indicate that TRAF proteins may function as signal transducers for distinct receptor families 
and that TRAF6 participates in IL-1 signalling. The transcription factor NF-kappa-B is 

I : , I' ' 

activated by many cytokines that signal through different cell surface receptors. Members of 
the TRAF protein family have been implicated in the activation of this transcription factor by 
5 the tumor necrosis factor (TNF) superfamily. TRAF2 is required for activation of this 

transcription factor by 2 TNF receptors, TNFR1 and TNFR2 , as well as CD40, and TRAF5 
may be responsible for NF-kappa-B activation signaled by the lymphotoxin B receptor . Qao et 
al. (Nature. 383(6599):443~6, 1996) identified a new member of the TRAF family, designated 
TRAF6. When overexpressed in cultured human cells, TRAF6 activates NF-kappa-B. A 

10 dominant-negative mutant of TRAF6 inhibited this activation signaled by interleukin-1 . ILIA 
treatment of the same cells induced the association of TRAF6 with interleukin-1 -associated 
kinase, a serine/threonine kinase that is rapidly recruited to the receptor after ILIA 
induction. The findings were interpreted as indicating that TRAF proteins function as signal 
transducers for distinct receptor families in that TRAF6 participates in ILIA signaling. 

15 TRANCE (or RANKL), a TNF family member, and its receptor, RANK, are critical regulators 
of dendritic cell and osteoclast function. Wong et al. (Mol Cell. 4(6): 1 04 1 -9, 1 999) 
demonstrated that TRANCE activates the antiapoptotic serine/threonine kinase PKB through a 
signaling complex involving SRC and TRAF6: A deficiency in SRC or addition of SRC 
family kinase inhibitors blocked TRANCE-mediated PKB activation in osteoclasts. SRC and 

20 TRAF6 interacted with each other and with RANK upon receptor engagement. TRAF6, in 

turn, enhanced the kinase activity of SRC, leading to tyrosine phosphorylation of downstream 
signaling molecules such as CBL . These results defined a mechanism by which TRANCE 
activates SRC family kinases and PKB, and provided evidence of cross-talk between TRAF 
proteins and SRC family kinases. TRAF6 is a signal transducer in the NF-kappa-B pathway 

25 that activates I-kappa-B kinase in response to proinflammatory cytokines. Deng et al. 

(CelL;103(2):351-61, 2000) purified a heterodimeric protein complex that links TRAF6 to 
IKK activation. Peptide mass fingerprinting analysis revealed that this complex is composed 
of the ubiquitin conjugating enzyme UBC13 and the UBC-like protein UBE2V1. They found 
that TRAF6, a RING domain protein, functions together with UBC13/UBE2V1 to catalyze the 

30 synthesis of unique polyubiquitin chains linked through lysine-63 (K63) of ubiquitin. 

Blockade of this polyubiquitin chain synthesis, but not inhibition of the proteasome, prevents 
the activation of IKK by TRAF6. These results unveil a new regulatory function for ubiquitin, 
in which IKK is activated through the assembly of K63-linked polyubiquitin chains. 
Takayanagi et al. (Nature. 408(68 12):600-5, 2000) demonstrated that T-cell production of 
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interferon-gamma strongly suppresses osteoclastogenesis by interfering with the RANKL- 
RANK signaling pathway. IFNG induces rapid degradation of the RANK adapter protein, 
TRAF6, resulting in strong inhibition of the RANKL-induced activation of the transcription 
factor NFKB and INK. This inhibition of osteoclastogenesis could be rescued by 
5 overexpressing TRAF6 in precursor cells, indicating that TRAP6 is the target critical for the 
IFNG action. Furthermore, Takayanagi et al. (20,00) provided evidence that the accelerated 
degradation of TRAF6 requires both its ubiquitination, which is initiated by RANKL, and 
IFNG-induced activation of the ubiquitin-proteasome system. Takayanagi et al. (2000) 
concluded that their study showed that there is crosstalk between the tumor necrosis factor and 

10 tFN families of cytokines, through- which IFNG provides a negative link between T-cell 
activation and bone resorption. 

The above defined information for NOV1 1 suggests that this NOV1 1 protein may 
function as a member of a Interleukin-1 signal transducer protein family. Therefore, the 
NOV1 1 nucleic acids and proteins of the invention are useful in potential therapeutic and 

1 5 diagnostic applications. For example, a cDNA encoding the ( NOVl 1 protein may be useful in 
gene therapy, and the NOV1 1 protein may be useful when administered to a subject in need 
thereof By way of nonlimiting example, the compositions of the present invention will have 
efficacy for treatment of patients suffering from bone metabolism disorder; proinflammatory 
disorder; immune disorder; inflammatory disease; septic shock; stroke; diabetes; arthritis; 

20 intercolitis; pneumonitis; epithelial cell; skin disease; proliferative disorder; skin cancer; 
melanoma; Kaposi's sarcoma; epithelial cancer; squamous cell carcinoma; bone resorption 
disorder; osteoporosis; Paget's disease; osteoarthritis; degenerative arthritis; osteogenesis 
imperfecta; fibrous displasia; hypophosphatasia; bone sarcoma; myeloma bone disorder; 
osteolytic bone lesion; hypercalcemia; bone mass; bone fragility; bone pain; bone deformity 

25 and/or bone fracture. The NOV1 1 nucleic acid encoding Interleukin-1 signal transducer-like 
protein, and the Interleukin-1 signal transducer-like protein of the invention, or fragments 
thereof, may further be useful in diagnostic applications, wherein the presence or amount of 
the nucleic acid or the protein are to be assessed. 

NOV12 

30 A disclosed NOV12 nucleic acid of 1 097 nucleotides (designated CuraGen Acc. No. 

CG561 38-01) encoding a novel Olfactory receptor/ G-Protein Coupled Receptor-like protein 
is shown in Table 12A. An open reading frame was identified beginning with an ATG 
initiation codon at nucleotides 5-7 and ending with a TAG codon at nucleotides 959-961. 
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Putative untranslated regions upstream from the initiation codon and downstream from the 
termination codon are underlined in Table 12 A, and the start and stop codpris are in bold 
letters. 1 1 r 



Table 12A. NOV12 Nucleotide Sequence (SEQ ID NO:25) 

AGTC ATGTGCTCAGGGAATCAGACTTCTCAGAATCAAACAGCAAGCACTGATTTCACCCTCACGGGACTCTTTGCTGAG 

AGCAAGCATGCTGCCCTCCTCTACACCGTGACCTTCCTTCTTTTCTTGATGGCCCTCACTGGGAATGCCCTCCTCATCC 

TCCTCATCCACTCAGAGCCCCGCCTCCACACCCCCATGTACTTCTTCATCAGCCAGCTCGCGCTCATGGATCTCATGTA 

CCTATGCGTGACTGTGCCCAAGATGCTTGTGGGCCAGGTCACTGGAGATGATACCATTTCCCCGTCAGGCTGTGGGATC 

CAGATGTTCTTCTACCTGACCCTGGCTGGAGCTGAGGTTTTCCTCCTGGGTGCCATGGCCTATGACCGATATGCTGCTG ' 

TTTGCAGACCTCTCCATTACCCACTGCTGATGAACCAGAGGGTGTGCCAGCTCCTGGTGTCAGCCTGCTGGGTTTTGGG 

AATGGTTGATGGTTTGTTGCTCACCCCCATTACCATGAGCTTCCCCTTTTGCCAGTCTAGGAAAATCCTGAGTTTTTTC 

TGTGAGACTCCTGCCCTGCTGAAGCTCTCCTGCTCTGACGTCTCCCTCTATAAGACGCTCATGTACCTGTGCTGCATCC 

TCATGCTTCTCGCCCCCACCATGGTCATCTCCAGCTCATACACCCTCATCCTGCATCTCATCCACAGGATGAATTCTGC 

CGCCGGCCACAGGAAGGCCTTGGCCACCTGCTCCTCCCACATGATCATAGTGCTGCTGCTCTTCGGTGCTTCCTTCTAC 

ACCTACATGCTCCCGAGTTCCTACCACACAGCTGAGCAGGACATGATGGTGTCTGCCTTTTACACCATCTTCACTCCTG 

TGCTGAACCCCCTCATTCACAGTCTCCGCAACAAAGATGTCACCAGGGCTCTGAGGAGCATGATGCAGTCAAGAATGAA 

CCAAGAAAAGTAG TAAAGGGCAAGCATTGTCCCCTCCTCTTTCTATAATTCCGTTACTCCCTATCTCTCCTCTCTTTTG 

CCCTCAGGTCTCCGGGTCCCCAGCACAAAGCCCACTCATATTTTCCTTCTTTCTTATACGTGGCGTTTTC 



The nucleic acid sequence of NOV12 hks 548 of 903 bases (60%) identical to a Galhis 
gallus cor4 DNA for olfactory receptor 4 mRNA (gb:GENBANI£- 
ID:GGCOR4GEN|acc:X94744.1) (E = 2.5^ 1 ), 

A NOV12 polypeptide (SEQ ID IvfO:26) encoded by SEQ ID NO:25 is 318 amino acid 
residues and is presented using the one letter code in Table 12B. Signal P, Psort and/or 
Hydropathy results predict that NOV 12 contains a signal peptide and is likely to be localized 
at the plasma membrane with a certainty of 0.6000. The most likely cleavage site for a NOV 12 
peptide is between amino acids 56 and 57, at: IHS-EP. 



Table 32B. NOV12 protein sequence (SEQ ID NO:26) 



MCSGNQTSQNQTASTDFTLTGLFAESKHAALLYTVTFLL^ 
PKMLVGQVTGDDTISPSGCGIQMPFYLTIiAGAEVFLLAAM^ 

ITMSFPFCQSRKILSFFCETPALLKLSCSDVSLYKTLMYLCCILMLIiAPTMVI SS SYTLI LHLI HRMNSAAGHRKALATCSSH 
Mil VIiLLFG ASFYTYMLPS S YHTAE QDMMVSA F YT I FT P VLNPLI H S LRNKD VTRA LR SM M Q S RMNQEK 



The NOV12 amino acid sequence has 142 of 297 amino acid residues (47%) identical 
to, and 194 of 297 amino acid residues (65%) similar to, a Mus musculus 316 amino acid 
residue T2 olfactory receptor (ptnr :TREMBLNEW- ACC : AAG45 1 96) (E = 9.3e 73 ). 

NOV12 is expressed in at least the following tissues: Apical microvilli of the retinal 
pigment epithelium, arterial (aortic), basal forehrain, brain, Burkitt lymphoma cell lines, 
corpus callosum, cardiac (atria and ventricle), caudate nucleus, CNS and peripheral tissue, 
cerebellum, cerebral cortex, colon, cortical neurogenic cells, endothelial (coronary artery and 
umbilical vein) cells, palate epithelia, eye, neonatal eye, frontal cortex, fetal hematopoietic 
cells, heart, hippocampus, hypothalamus, leukocytes, liver, fetal liver, lung, lung lymphoma 
cell lines, fetal lymphoid tissue, adult lymphoid tissue, tissues that express MHC II and III, 
nervous, medulla, subthalamic nucleus, ovary, pancreas, pituitary, placenta, pons, prostate, 
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putamen, serum, skeletal muscle, small intestine, smooth muscle (coronary artery in aortic) 
" spinal cord, spleen, stomach, taste receptor cells of the tongue, testis, thalamus, and thymus 
tissue. This information was derived by determining the tissue sources of the sequences that 
were included in the invention including but not limited to SeqCalling sources, Public EST 
5 sources, Literature sources, and/or RACE sources. 

NOV12 also has homology to the amino acid sequences shown in the BLASTP data 



listed in Table 12C. 



Table 12C. BLAST results for NOV12 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
<%) 


Positives 
(%) 


Expect 


qi 1 17437056 | ref jXP 


similar to 
OLFACTORY 
RECEPTOR 2T1 
( OLFACTORY 
RECEPTOR 
1-25) (OR1-25) 
[Homo sapiens] 


695 


308/312 
(98%) 


311/312 
(98%) 


e-153 


060314 .2 | 
(XM_060314) 


gi | 17437027 | ref |XP 


similar to 
olfactory 
receptor [Homo 
sapiens] 


318 


310/318 
(97%) 


312/318 
(97%) 


e-147 


060309.1 i 
(XM_060309) 


gi j 17437021 1 ref |XP 


similar to 
OLFACTORY 
RECEPTOR 2T1 
(OLFACTORY 
RECEPTOR 
1-25) (OR1-25) 
[Homo sapiens] 


365 


203/301 
(67%) 


238/301 
(78%) 


e-102 


060308 . 1 | 
(XM_060308) 


gi| 17437059|ref |XP 


similar to 
OLFACTORY 
RECEPTOR 2T1 
(OLFACTORY 
RECEPTOR 
1-25) (OR1-25) 
[Homo sapiens] 


348 


202/301 
(67%) 


237/301 
(78%) 


e-101 


060315 . 1 | 
(XMJD60315) 


qi (15293809 jgb|AAK9 


olfactory 
receptor [Homo 
sapiens] 


217 


211/217 
(97%) 


212/217 
(97%) 


2e-97 


5097.1 | (AF399612) 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
10 in Table 12D. 



Table 12D ClustalW Analysis of NOV12 

1) NOV12(SEQIDNO:26) 

2) gi |174370561ref1XP 060314. 1 [ (XM_060314) similar to OLFACTORY RECEPTOR 2T1 (OLFACTORY 
RECEPTOR 1-25) (OR1-25) [Homo sapiens] (SEQ LD NO: 1 38) 

3) gill7437027lref]XP 06 0309. 1| (XM 060309) similar to olfactory receptor [Homo sapiens] (SEQ ID 
NO; 139) 

4) gj ]1 743 7 02 llreflXP 060308.1) (XMJ)60308) similar to OLFACTORY RECEPTOR 2T1 (OLFACTORY 
RECEPTOR 1-25) (OR1-25) [Homo sapiens] (SEQ ID NO: 140) 

5) gi|17437059lr eflXP_060315.1| (XM_060315) similar to OLFACTORY RECEPTOR 2T1 (OLFACTORY 
RECEPTOR 1-25) (OR1-25) [Homo sapiens] (SEQ ID NO:141) 

6) gi|15293809lgblAAK95097.1| (AF399612) olfactory receptor [Homo sapiens] (SEQ ID NO: 142) 

10 20 30 40 50 60 70 
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NOV12 

gi 17437056 [ 
gi a?437027| 
gi 17437021) 
gi 17437059| 
gi 15293809) 



NOV12 

gi [17437056 j 
gi| 17437027 | 
gi|l7437021 | 
gi j 17437059 j 
gi|l5293809| 



NOV12 

gi [17437056 | 
gi|l7437027| 
gi | 17437021 j 
gi |17437059| 
gijl5293809 j 



NOV12 

gi|l7437056 | 
gijl7437027j 
gi [ 17437021 j 
gi |17437059| 
gijl5293B09( 



NOV12 

gi[l7437056 | 
gi | 17437027 | 
gi 17437021] 
gi [17437059 j 
gi|l5293809j 



NOV12 

gi 17437056) 
gi 17437027 | 
gi 17437021J 
gi 17437059J 
gi 15293809| 



NOV12 

gi [ 17437056 ] 
gi |17437027 j 
gi 17437021| 



9i 
9i 



17437059 j 
15293809 j 



NOV12 



9^ 
gi 
gi 
gi 
gi 



17437056 
17437027 
17437021 
17437059 
15293809 



NOV12 

gij 17437056 [ 
gi|l7437027 j 
gi|l743702l| 



I I I I i J I - - - - I 

,„.---- MC^gNQTSQNQSASTg 

LI J MCsSjQTSQNQMASTlJ 

Mcjs^QTSQNQHASTjj 

MDNITWI^SHTGWSDFILMGLFRQSKHP^feI l TWMAN}iSGW^S 




110 



130 




CETPALLKLSCSDVSLY 



300 
..|.. 



330 



340 



LLLFGASPYTYMLPSSYHTAEQDMMVSAFVTIFTP 



PIiXgjSLREKD 



EYTYMLPSSYIHl 
□YTYMLPSSYHT 



.YtnQEBDMMVSiBgYTlBTPVlSl 

;yht||e2dmmvsB f vt i Bf p ^®: 




js^MQSR- -FlN^E!5 

[qa ^eqsnysv 

Iqsr-- mnIqeS — 

!t 



liLLFGASFYTYMLPSSYHTAEQDMMVSAFYTI FTP 



360 



370 

..|....|. 



380 



390 



400 



410 



420 
.|....| 



YABFILLGLFSNARFPWLLFALILliVFLTSIASNVVKIIlL.IHIDSRLHTPMYFLL.SQljSLRDIIjYISTIV 
SVFCTVLTPMLNP 



- LI YI IjRNKDWGLFRKFWEHI KSLjNRTHKYQCGKQR- 
„ VEPAFQKAME - 



440 
.|.-.-|.. 



480 



490 
••I 



PKMLVDQVMSQRAISFAGCTAQHFLYLiTLAGAEFFLLGLMSYDRYVAICNPLHYPVLMSRKICWLIVAAA 



510 



520 



530 

WIiGGSir^FLLTPVTMQFPFCASREINHFFCEVPALLKLSCrDTSAYETAMYVCCIMMLIilPFSVISGSY 



540 



560 



570 

■ I — i — i — i — i — i — i — i — i — i — i — i • • 

TR1 LITVYRKSEAEGRGKAVATCS SHMVWSLF YGAAMYTYVLPHS YHTPEQDKAVSAF YTI LTPMLNPL 



580 

..|.. 



590 
..|.. 



630 
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gi|l7437059| 
gi |l5293809j 



NOV12 

gi|l7437056 
gi|l7437027 
gijl7437021 
gi|l7437059 
gi|l5293809 



NOV12 



gi 
gi 
gi 
gi 



17437056 
17437027 
17437021 
17437059 
15293809 



640 



650 
..|.. 



660 



670 



680 
..I.. 



690 



700 



-I | . - | - 

IYSLRNKDVTGALQKVVGRMEWKTLPFQALQVRCVKWRRSVIjVSSFIATERTLADTSHSSSHAEFPERGV 



710 

..I.. 



RMNCSKIiFSLVEEPVTSLGDLPNFR 



Table 12E lists the domain description from DOMAIN analysis results against 
NOV12. This indicates that the NOV12 sequence has properties similar to those of other 
proteins known to contain these domains. 



Table 12E. Domain Analysis of NOV12 

gnl 1 Pf am| pf amOOOOl , 7tm_l / 7 transmembrane receptor (rhodopsin 

family). (SEQ ID N0:143) 

Length = 254 residues, 99.6% aligned 

Score = 107 bits (268) , Expect = 8e-25 



NOV12: 46 GNALLILLIHSEPRLHTPMYFFISQLALMDLMYLCVTVPKMliVGQVTGDDTISPSGCGIQ 105 

GN L+IL+I +L TP F+ LA+ DL++L PL V GD + C + 

00001 : 1 GNLLVILVILRTKKLRTPTNIFLLNLAVADLLFLLTLPPWALYYLVGGDWVFGDALCKLV 60 

NOV12: 106 MFFYLTLAGAEVFLLAAMAYDRYAAVCRPLHYPLLMNQRVCQLLVSACWVLGMVDGLLLT 165 

++ A + LL A+ + DRY A+ PL Y + R ++L+ WVL ++ 

00001 : 61 GALFWNGYASILLLTAISIDRYLAIVHPLRYRRIRTPRRAKVLILLVWVLALL 114 



NOV12 : 166 PITMSFPFCQSRKILSFFCETPALLKLSCSDVSLYKTLMYLCCILMLLAPTMVISSSYTL 225 

+S P + + + + + S+ ++ + L ++ + P +VI YT 

00001 : 115 - - -LSLPPLLFSWLRTVEEGNTTVCLIDFPEESVKRSYVLLSTLVGFVLPLLVILVCYTR 171 

NOV12 : 226 ILHLIHR MNSAAGHRKALATCSSHMI 1VLLLFG ASFYTYMLPSSYHT 272 

IL + + ++ RKA ++ + +L + + L S + 

00001: 172 ILRTLRKRARSQRSLKRRSSSERKAAKMLLVWWFVLCWLPYHIVLLLDSLCLLSIWRV 231 

NOV12: 273 AEQDMMVSAFYTI FTPVLNPLI 294 

++++ + LNP+I 
00002: 232 LPTALL I TLWLAYVNSCLNPI I 2 53 



G-Protein Coupled Receptors (GPCRs) have been identified as an extremely large 
family of protein receptors in a number of species. At the phylogenetic level they can be 
classified into four major subfamilies. These receptors share a seven transmembrane domain 
1 0 structure with many neurotransmitter and hormone receptors. They are likely to be involved 
in the recognition and transduction of various signals mediated by G-Proteins, hence their 
name G-Protein Coupled Receptors. The human GPCR genes are generally intron-less and 
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belong to four gene subfamilies, displaying great sequence variability. These genes are 
dominantly expressed in olfactory epithelium. 

Olfactory receptors (ORs) have been identified as an extremely large family of GPCRs 
in a number of species. As members of the GPCR family; these receptors share a seven 
5 transmembrane domain structure with many neurotransmitter and hormone receptors, and are 
likely to underlie the recognition and G-protein-mediated transduction of odorant signals. Like 
GPCRs, the ORs can be expressed in a variety of tissues where they are thought to be involved 
in recognition and transmission of a variety of signals. The human OR genes are typically 

intron-less and belong to four different gene subfamilies, displaying great sequence variability. 

i ... • 

10 These genes are dominantly expressed in olfactory epithelium. 

The above defined information for NOV 12 suggests that this NOV 12 protein may 

1 function as a member of a Olfactory receptor/ G-Protein Coupled Receptor protein family. 

Therefore, the NOV 12 nucleic acids and proteins of the invention are useful in potential 

therapeutic and diagnostic applications. For example, a cDNA encoding the NOV12 protein 

1 5 may be useful in gene therapy, and the NOV12 protein may be useful when administered to a 
subject in need thereof. By way of nonlimiting example, the compositions of the present 
invention will have efficacy for treatment of patients suffering from developmental diseases, 
MHCII and III diseases (immune diseases), taste and scent detectability disorders, Burkitt's 
lymphoma, corticoneurogenic disease, signal transduction pathway disorders, retinal diseases 

20 including those involving photoreception, cell growth rate disorders, cell shape disorders, 
feeding disorders, potential obesity due to over-eating, potential disorders due to starvation 
(lack of appetite), noninsulin-dependent diabetes mellitus (NIDDM1), bacterial, fungal, 
protozoal and viral infections (particularly infections caused by HIV-1 or HIV-2), pain, cancer 
(including but not limited to neoplasm; adenocarcinoma; lymphoma; prostate cancer; uterus 

25 cancer), anorexia, bulimia, asthma, allergies, Parkinson's disease, acute heart failure, 

hypotension, hypertension, urinary retention, osteoporosis, Crohn's disease, multiple sclerosis, 
Albright hereditary osteodystrophy, angina pectoris, myocardial infarction, ulcers, benign 
prostatic hypertrophy, psychotic and neurological disorders (including anxiety, schizophrenia, 
manic depression, delirium, dementia, and severe mental retardation), dentatorubro- 

30 pallidoluysian atrophy (DRPLA), hypophosphatemic rickets, autosomal dominant (2) 
acrocallosal syndrome and dyskinesias, such as Huntington's disease and/or Gilles de la 
Tourette syndrome. The NOV12 nucleic acid encoding Olfactory receptor/ G-Protein Coupled 
Receptor-like protein, and the Olfactory receptor/ G-Protein Coupled Receptor-like protein of 
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the invention, or fragments thereof, may further be useful in diagnostic applications, wherein 
the presence or amount of the 'fiucleic acid or the protein are to be assessed. ■ 

NOV13 ' ' 



A disclosed NOV13 nucleic acid of 1606 nucleotides (designated CuraGen Acc. No. 
CG56097-01) encoding a novel glucuronosyltransferase-like protein is shown in Table 13A. 
An open reading frame was identified beginning with an ATG initiation codon at nucleotides 
1-3 and ending with a TAG codo'n at nucleotides 1 600-1 602. A putative untranslated region 
downstream from the termination codon is underlined in Table 13 A, and the start and stop 
codons are in bold letters. 



Table 13A. NOV13 Nucleotide Sequence (SEQ ID NO:27) 



ATGGCTATGAAATGGACTTCAGTCCTTCTGTTGATACAGCTGAGCTATTACTCTAGCTCTGGGAGTTGTGGAAATGTGC 
CGCTGTGGCCCATGGAATATAGTCCTTGGATGAATATAAAGACAATCCTGGATAAACTTATGC*AGATAAGTCATGAGGT 
GACTGTTCTAACATTGTCAGCTTCCATTCTTGTTGATCCCAACATAACATCTGTTACTAAATTTGAGGTTTATTCTATA 
TCTGTAATTAAAGATGATTTTGCAGGGTTTTTTTTCACACAACAGATTACTAAATGGATACATGATCTTCCAAAACATA 
TATTTTGGTTTAAATGTGTTCCCTTCAAGAATATTCTTTGGGAATATTCTGGTTATACTGAGAAGTTCTTTAAAGATGT 
AGTTTTGAACAAGAAACTTATGACAAACCTACAAGAATCAAGGTCTGATGTCGTTCATGCAAATGCCATTGGTCCCTTT 
GGAGAGCTGCTGGCTGAGCTATTAAAAATATCCTTTGTGTACAGTCTCCACTTCTCTCCTGGCTACACATTTGAGAAAT 
ACAGTGGAGGATiTCTACTTCCACCTTCCTATGGAGCTGTTArTCTGTCAGAATTAAGTGGTTCGATGACATTCATGGA 
GACAGTAAGAAATATTATATATGTGTTTTATTTTGACTTTTGGTTCCAAACATTTGATATGAAGAAGGGAGACCAGTTT 
TACAGTGAAGTTCrAGGTAAGTCATGTTTTTTATCTGAGATAATGGGAAAAGCTGAAATGTGGCTCATTCGAAACTACT 
GGTATTTGGAATTTCCTCGCCCACTCTTACCTAATTTTGAATTTGTTGTAAGACTCTACTGCAAACCTGTCAACCCCCT 
GCCTAAGGAGAAAATGGAAGAATTTGCCCAGAGCTCTGATGAAGACGGTGTTGTGTTTTCTCTGGAGTCAGCTGTGCAA 
AACCTTACAGAAGAAAAAGCTGATCTTATCACTTCGGCCCTGGCTCAGATTCCACAAAAAGTCATGAAGTTCGGAAGGA 
AACCAAATACCTTAAGATCCAATACTCAGTGGCATAGGTGGATCCCACAGAATGAATGTCTTATCCTAGATCATCCCCA 
AACCAAAGCCTTTATAACTTATGGTGGAACAAATAGCATCTATGAGATGATCTACCGTGGAGTCCCTTCCATGGGCATT 
CCTTTGTTTGCGGACCAACATGATAACATTGCTCACATGAAGGCCAAGGGAGCAGCTGTTATATTGGACTTGAGCACAA 
AGTCAAGTACAGATTTGCTCGATATATCTGTGTTCGTATCTTTATTTTTATCCTTCAGATATAAAGAGAGTGTTATGAA 
ATTATCAAGAATTCAACATGATCAACCAGTGAAGCCCCTGGATCGAGCAGTCTTCTGGATTGAATTTGTCATGCGCCAC 
AAAGGAGCCAAACACCTTCGAGTTGCAGCCCGTGACCTCACCTGGTTCCAGTACCACTCTTTGGATGTGATTGGGTTTC 
TGCTGGCCTGTGTGGCAACTGTGACATTTATCATCACAAAGTGTTGTCTGTTTTGTTTCTGGAAGTTTACTAGAAAAGT 
GAAGAAGGAAAAAAGGGATTAGTTAT 



The nucleic acid sequence of NOV13 maps to chromosome 4 and has 1305 of 1606 
bases (81%) identical to zHomo sapiens 3,4-catechol estrogen UDP-glucuronosyltransferase 
mRNA (gb:GENBANK-ID:HUMUDPGTA|acc:J05428.1) (E = 6.4e" 217 ). 

A NOV13 polypeptide (SEQ ID NO:28) encoded by SEQ ID NO:27 is 533 amino acid 
residues and is presented using the one letter code in Table 13B. Signal P, Psort and/or 
Hydropathy results predict that NOV13 contains a signal peptide and is likely to be localized 
at the endoplasmic reticulum (membrane) with a certainty of 0.8200. The most likely cleavage 
site for aNOV13 peptide is between amino acids 20 and 21, at: SSS-GS. 



Table 13B. NOV13 protein sequence (SEQ ID NO:28) 

MAMKWTSVbljLIQLSYYSSSGSCGNVPLWPMEY^ 

DDPAGFFFTQQITKWIHDLPKHIPWFKCVPFKNILWEYSGYTEKPFKIJWLNKKLMTNLQESRSDVVHANAIGPFGELLAELL 
KISFVYSLHFSPGYTFEKYSGGFLLPPSYGAVILSELSGSMTFMETVRNIIYVFYFDFWFQTFDMKKGDQFYSEVLGKSCFLS 
EIMGKAEMWLIRNYWYLEFPRPLLPNFEFVWLYCKPWPLPKEK^ 

I PQKVMKFGRKPNTLRSNTQWHRWI PQNECLILDHPQTKAFI TYGGTNS I YEMI YRGVPSMGI PLFADQHDNI AHMKAKGAAV 
I LDLSTKSSTDLLDI SVFVSLFLSFRYKESVMKLSRI QHDQPVKPLDRAVFWIEFVMRHKGAKHLRVAARDLTWFQYHSLDVI 
GFLIACVATVTFIITKCCLFCFWKFTRKVKKEKRD 
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The NOV 13 amino acid sequence has 353 of 533 amino acid residues (66%) identical 
to, and 412 of 533 amino acid residues (77%) similar to, a Homo sapiens 529 amino acid 
residue UDP-glucuronosyl trans ferase 2b7 precursor, microsomal (EC 2.4:1 111) (UDPGT) 
(3 ,4-catechol estrogen specific) (ptnr:SWISSPROT-ACC:P 16662) (E = 7.1e' 185 ). 
5 NOV13 is expressed in at least the following tissues: Liven This information was 

derived by determining the tissue sources of the sequences -that were included in. the invention 
including but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or 
RACE sources. In addition, NOV 13 is predicted to be expressed in liver tissues because of the 
expression pattern of a closely related Homo sapiens 3,4-catechol estrogen UDP- 
10 glucuronosyltransferase mRNA homolog (gb:GENBANK-m:HUMUDPGTA|acc:J05428.1). 

NOV 13 also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 13 C. ' 



Tabte 13C. BLAST results for NOV13 



Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi |4507825 |ref |NP 0 
01065.1 | 
(NM_001074) 


UDP ' 
glycosyltransf era 
se 2 family, 
polypeptide B7 ; 
UDP- 

glucuronyltransf e 
rase, family 2/ 
beta- 7 [Homo 
sapiens] 


529 


334/536 
(62%) 


387/536 
(71%) 


e-179 


gi | 6175083 jspjP0613 
3 |UDB4 HUMAN 


UDP- 

GLUCURONOSYLTRANS 
FERASE 2B4 
PRECURSOR, 
MICROSOMAL 
(UDPGT) 
(HYODEOXYCHOLIC 
ACID) (HLUG25) 
(UDPGTH-1) [Homo 
sapiens] 


528 


333/536 
(62%) 


386/536 
(71%) 


e-176 


qi | 484383 fpir | | JN06 
19 


glucuronosyl trans 
ferase (EC 
2.4.1.17) 2B-4 
precursor [Homo 
sapiens] 


528 


332/536 
(61%) 


385/536 
(70%) 


e-175 


qi| 3153 832 jqb I AAC95 
002. l| (AF064200) 


UDP- 
glucuronosyl trans 
ferase 2B4 
precursor [Homo 
sapiens] 


528 


332/536 
(61%) 


386/536 
(71%) 


e-175 


qi | 4 079707 Jqb | AAC98 
726. ll (AF016310) 


UDP- 
glucuronosyl trans 
ferase [Macaca 
f ascicularis] 


529 


330/536 
(61%) 


385/536 
(71%) 


e-174 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
15 in Table 1 3D. 
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Table 13D ClustalW Analysis of NOV13 

1) NOV13(SEQIDNO:28) 

2) gj|4507825iref|NP 001065.11 (NMJ)01074) UDP glycosyltransferase 2 family, polypeptide B7; UDP- 
glucuronyltransferase, family 2, beta-7 [Homo sapiens] (SEQ ID NO: 144) 

3) gj |6175O83lsplP06133IUDB4 HUMAN UDP-GLUCURONOSYLTRANSFERASE 2B4 PRECURSOR, 
MICROSOMAL (UDPGT) (HYODEOXYCHOLIC ACID) (HLUG25) (UDPGTH-1) [Homo sapiens] (SEQ ID 
NO:I45) 

4) gi|484383lpi rH3NQ619 glucuronosyltransferase (EC 2.4.1.17) 2B-4 precursor [Homo sapiens] (SEQ ID 
NO: 146) 

5) gi[3153832|g b |AAC95002.1l (AF064200) UDP-glucuronosyltransferase, 2B4 precursor [Homo sapiens] (SEQ 
ID NO: 147) 

6) gil 4079707]gb|AAC98726.1i (AF016310) UDP-glucuronosyltransferase [Macaca fascicularis] (SEQ ID 
NO: 148) 



NOV13 



9i 
S* 



4507825 
6175083 
484383 | 
3153832 



gi 4079707 



NO VI 3 

gi | 4507825 
gi | 6175083 
gi j484383 } 
gi|3153832 
gi |4079707 



NOV13 

gi|4507825 
gi j 6175083 
gi J 484383 | 
gi (3153832 
gi |4079707 



NOV13 

gi | 450782S 
gi | 6175083 
gi (484383 | 
gi (3153832 
gi|4079707 



NO VI 3 

gi|4507825 
gi | 6175083 
gi | 484383 | 
gi | 3153832 
gi|4079707 



NOV13 

gi|4507825 
gi j 6175083 
gi (484383 | 
gi|3153832 
gi {4079707 




NOV13 

gij4507825 j 



430 



440 

L---l-^,|.--.| 

ISVFVjSIifSsFR 

5Es 



460 
..I .. 



470 
. . I . . 



480 



490 
| j 



yKEgfflMKLSRlgHDQPVKPLDI^VFWIEFVMRHKGAKHLRVAflgPL 

ykei^mklsriBhdqpvkpldravfwiefvmrhkgakhlrvaahdl 
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gij6175083| 
gi|484383 | 
gi|3l53832| 
gi|4079707j 



NOV13 

gi (4507825 | 
gij 6175083 | 
gi j 484383 | 
gi|3153832 j 
gi|4079707| 



DFHTMSS 
DFHTMSS' 
DFHTMSS 1 

dfStmss 1 




Table 13E lists the domain description from DOMAIN analysis results against 
NOV 13. This indicates that the NOV 13 sequence has properties similar to those of other 
proteins known to contain these domains. 



Table 13E. Domain Analysis of NOV13 

gnl 1 P f a m j pf am00201 , UDPGT, UDP-glucoronosyl and UDP-glucosyl 

transferase (SEQ ID NO: 149) 

Length = 501 residues, 100.0% aligned 

Score = 587 bits (1514), Expect = 4e-169 



NOV13 : 24 GNVPLWPME YS PWMNI KT I LDKLMQ I SHEVTVLTLS AS I LVp PNI TS VTKFEV YS I S VI K 83 

G V +WPM+ S WMN+K IL +L+Q HEVTVL SASIL+ P S KFE Y S K 
00201: 1 GKVLVWPMDGSHWMNMKGILiLELVQRGHEVTVLRPSASILIGPAKPSNIjKFETYPDSATK 60 

NOV13 : 84 DDFAGFFFTQQITKWIHDLPKHI FWFKCVPFKNILW EYSGYTEKFFKDWLNK 136 

+ + F K + + WF +W EYS K++V NK 

00201 : 61 EELENLF PKRVMN WFMEAAEAGTVWSYFSALQEYSDGARVSCKELVGNK 109 

NOV13 : 137 KLMTNLQESRSDWHANAIGPFGELLAELLKISFVYSLHFSPGYTFEKYSGGFLLPPSYG 196 

LMT LQES DW A+ + P G LLAELL I VYSL F PGY EK GG PPSY 
00201: 110 FLMTKLQESSFDWLADPVWPCGALLAELLHIPTVYSLRFVPGYAAEKADGGLPAPPSYV 169 

N0V13 : 197 AVILSELSGSMTFMETVRNII YVFYFDFWFQTFDMKKGDQFYSEVLGKSCFLSEIMGKAE 256 

V LS+LS MTF E V+N++ + YFDFWFQ F KK DQF SE+LG+ L E + KA 
00201: 170 PVRLSDLSDGMTFGERVKNMLIMLYFDFWFQRFP-KKWDQFASELLGRPVTLPEDLSKAS 228 

NOV13: 257 MWLIRNYWYLEFPRPLLPNFEFWRLYCKPVNPLPKEKMEEFAQSSDEDGW- FSLESAV 315 

WL+RNYW LEFPRPLLPN EF+ L CKP PLP+E ME F QSS E GW FSL S V 
00201: 229 AWLLRNYWDLEFPRPIiLPNMEFlGGLNCKPAKPLPQE-MEAFVOSSGEHGWVFSLGSMV 28 7 

NOV13 : 316 QNLTEEKADLITSALAQI PQKVM- KF-GRKPNTLRSNTQWHRWIPQNECLILDHPQTKAF 373 

N+ EEKA+ I SALAQIPQKV+ +F G KP+TL +NT+ +W+PQN+ +L HP+T+AF 
00201: 288 SNIPEEKANEIASALAQIPQKVLWRFDGTKPSTLGNNTRLVKWLPQND- - LLGHPKTRAF 345 

NOV13: 3 74 ITYGGTNSI YEMI YRGVPSMGIPLFADQHDNI AHMKAKGAAV ILDLSTKSSTDLLDISVF 433 

+T+ G+N +YE I GVP +G+PLF DQ DN HM+AKGAAV L++ T +S DLL+ 
00201: 346 VTHAGSNGVYEAICHGVPMVGMPLFGDQMDNAKHMEAKGAAVTLNVLTMTSEDLLNALK- 404 

NOV13 S 434 VSLFLSFRYKESVMKbSRIOHDQPVKPLDRAVFWIEFVMRHKGAKHLRVAARDLTWFQYH 493 

+ + YKE++M+LS I HDQPVKPLDRAVFWIEFVMRHKGAKHLR AA DLTW+QYH 

00201: 405 -TVINDPSYKENIMRLSSIHHDQPVKPLDRAVFWIEFVMRHKGAKHLRPAAHDLTWYQYH 463 

NOV13 : 494 SLDVIGFLLACVATVTFI ITKCCLFCFWKFTRKVKKEK 531 

S LDV I G FLIiACVATV FI KCCLF + KF K K+ K 
00201: 464 SLDVIGFLLACVA1VAFITFKCCLFGYRKFVGKKKRVK 501 
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The UDP-glucuronosyltransferases, a group of isoenzymes located primarily iii hepatic 

- .. . • . 

endoplasmic reticulum and nuclear envelope, are encoded by a large multigene family that has 
evolved to produce catalysts with differing but overlapping substrate specificities. Two 
subfamilies are recognized by sequence identities (Burchell' et al. 5 1991). UGT1 consists of at 
5 least 4 isoenzymes that catalyze the glucuronidation of phenols and bilirubin. All 4 map to 

chromosome 2 and probably derive from the same gene (UGT1). The UGT2 family contains at 
least 5 members catalyzing sterodd or bile acid glucuronidation. Members of the subfamily 
share 65 to 90% amino acid sequence identity. However, unlike the phenol UGT cDNAs, 
where the high degree of identity is concentrated in the 3-prime region of the cDNA, the 
1 0 steroid UGTs have a high degree of sequence homology throughout the cDNA. Jackson et al. 
(1987) cloned a human liver microsomal UDP-glucuronosyl transferase cDNA. By screening a 
liver library with a rat UGT2B1 cDNA, Jin et a]L (1993) isolated cE>NAs encoding UGT2B4, 
which they called UGT2B1 1, and,UGT2B10. The deduced amino acid sequences shared 
greater than 76% sequence similarity with other known human liver UGT2B subfamily 
1 5 isoforms. The predicted 528-amino acid UGT2B4 protein contained an N-terminal signal 
peptide and a C4erminal transmembrane hydrophobic region. Mammalian cells expressing 
UGT2B4 produced a 51-kD protein that was recognized by a UGT-specific polyclonal 
antibody. The expressed protein catalyzed the glucuronidation of several polyhydroxylated 
estrogens and xenobiotics. Monaghan et al. (1992) stated that the 5 members of subfamily 2 
20 are probably derived from independent genes. Due to the difficulty in isolating suitable 

specific cDNA probes from the UGT2 members, Monaghan et al. (1992) used the polymerase 
chain reaction (PCR) in the chromosomal mapping of the steroid UGT gene encoding a bile 
acid UGT, Analysis of a panel of human/rodent somatic cell hybrids indicated that the gene, 
symbolized UGT2B, is located on human chromosome 4. Burchell et al. (1991) recommended 
25 that the symbol Udpgt-3 used by Krasnewich et al. (1987) be changed to Ugt2b. Monaghan et 
al. (1992) used PCR to map the UGT2B4 gene to chromosome 4. The corresponding gene in 
the mouse was assigned to chromosome 5, which has a region of homology with human 
chromosome 4, by Krasnewich et al. (1987). By fluorescence in situ hybridization with a YAC 
containing all 3 genes, Monaghan et al. (1994) localized UGT2B4, UGT2B9 (also called 
30 UGT2B7), andUGT2B15 to 4ql3. They provisionally ordered the genes as UGT2B9-- 

UGT2B4--UGT2B15. Riedy et al. examined a genomic map spanning approximately 500 to 
1000 kb in the 4ql3 region. They placed UGT2B4 between UGT2B7 and UGT2B15. Also, 
access to a large reference DNA bank allowed them to calculate the allele frequencies for a 
single nucleotide polymorphism (SNP), Q458D, in UGT2B4 among 803 unrelated individuals 
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representing 5 ethnic populations. The findings suggested a recent evolutionary history of gene 
duplication, mutation, and rearrangement. . 

The above defined information for NOV 13 suggests that this NOV13 protein may 
function as a member of a glucuronosyltransferase protein family. Therefore, the NOV 13 
nucleic acids and proteins of the invention are useful in potential therapeutic and diagnostic 
applications. For example, a cDNA encoding the NOV1 3 protein may be useful in gene 
therapy, and the NOV13 protein may be useful when administered to a subject in need thereof 
By way of nonlimiting example, the compositions of the present invention will have efficacy 
for treatment of patients suffering from Crigler-Najjar syndrome and/or Gilbert syndrome, 
"the NOV13 nucleic acid encoding glucuronosyltransferase-like protein, and the 
glucuronosyltransferase-like protein of the invention, or fragments thereof, may further be 
1 useful in diagnostic applications, wherein the presence or amount of the nucleic acid or the 
protein are to be assessed. 

NOV14 

NOV 14 includes two novel prostasin-like proteins disclosed below and two nucleic 
acid assembly sequences. The disclosed proteins have been named NOV14a and NOV14b. 

NOV14a 



A disclosed NOV 14a nucleic acid of 1726 nucleotides (designated CuraGen Acc. No. 
CG56123-01) encoding a novel prostasin-like protein is shown in Table 14A. An open reading 
frame was identified beginning with an ATG initiation codon at nucleotides 229-231 and 
ending with a TGA codon at nucleotides 1 1 50-1 1 52. A putative untranslated region 
downstream from the termination codon is underlined in Table 14A, and the start and stop 
codons are in bold letters. 



Table 14A. NOV14a Nucleotide Sequence (SEQ ID NO:29) 



AGACGGTGCTGGTGACTCGTCCACACTGCTCGCTTCGGATACTCCAGGCGTCTCCCGTTGCGGCCGCTCCCTGCCTTAG 
AGGCCAGCCTTGGACACTTGCTGCCCCTTTCCAGCCCGGATTCTGGGATCCTTCCCTCTGAGCCAACATCTGGGTCCTG 
CCTTCGACACCACCCCAAGGCTTCCTACCTTGCGTGCCTGGAGTCTGCCCCAGGGGCCCTTGTCCTGGCC ATGGCCCAG 
AAGGGGGTCCTGGGGCCTGGGCAGCTGGGGGCTGTGGCCATTCTGCTCTATCTTGGATTACTCCGGTCGGGGACAGGAG 
CGGAAGGGGCAGAAGCTCCCTGCGGTGTGGCCCCCCAAGCACGCATCACAGGTGGCAGCAGTGCAGTCGCCGGTCAGTG 
GCCCTGGCAGGTCAGCATCACCTATGAAGGCGTCCATGTGTGTGGTGGCTCTCTCGTGTCTGAGCAGTGGGTGCTGTCA 
GCTGCTCACTGCTTCCCCAGCGAGCACCACAAGGGCTCCCAGGGCGACATTGCACTCCTCCAACTCAGCAGACCCACCA 
GCTACTCCCGCTACATCCGGCCCATCTGCCTCCCTGCAGCCAACGCCTCCTTCCCCAACGGCCTCCACTGCACTGTCAC 
TGGCTGGGGTCATGTGGCCCCCTCAGTGAGCCTCCTGACGCCCAAGCCACTGCAGCAACTCGAGGTGCCTCTGATCAGT 
CGTGAGACGTGTAACTGCCTGTACAACATCGACGCCAAGCCTGAGGAGCCGCACTTTGTCCAAGAGGACATGGTGTGTG 
CTGGCTATGTGGAGGGGGGCAAGGACGCCTGCCAGGGTGACTCTGGGGACCCACTCTCCTGCCCTGTGGAGGGTCTCTG 
GTACCTGACGGGCATTGTGAGCTGGGGAGATGCCTGTGGGGCCCGCAACAGGCCTGGTGTGTACACTCTGGCCTCCAGC 
TATGCCTC CTGG ATC C AAAGC AAG GTG A CAGAACTCCAG CCTCGTGTGGTG CCCCAAACCCAGGAGTCCCAGCCCGAC A 
GCAACCTCTGTGGCAGCCACCTGGCCTTCAGCTCTGCCCCAGCCCAGGGCTTGCTGAGGCCCATCCTTTTCCTGCCTCT 
GGGCCTGGCTCTGGGCCTCCTCTCCCCATGGCTCAGCGAGCACTGA GCTGGCCCTACTTCCAGGATGGATGCATCACAC 
TCAAGGACAGGAGCCTGGTCCTTCCCTGATGGCCTTTGGACCCAGGGCCTGACTTGAGCCACTCCTTCCTTCAGGACTC 
TGCGGGAGGCTGGGGCCCC A TCTTGATCTTTGAGCCCATTCTTCTGGGTGTGCTTTTTGGGACCATCACTGAGAGTCAG 
GAGTTTTACTGCCTGTAGCAATGGCCAGAG CC TCTGGCCCCTCACCCACCATGGACCAGCCCATTGGCCGAGCTCCTGG 
GGAGCTCCTGGGACCCTTGGCTATGAAAATGAGCCCTGGCTCCCACCTCTTTCTGGAAGACTGCTCCCGGCCCGCCTGC 
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CCAGACTGATGAGCACATCTCTCTGCCCTCTCCCTGTGTTCTGGGCTGGGGCCACCTTTGTGCAGCTTCGAGGACAGGA 
AAGGCCCCAATCTTGCCCACTGGCCaCTGAGCGCCCCCGAGCCCTGACTCCTGGACTCCGGAGGACTGAGCCCCCACCG 
GAACTGGGCTGGCGCTTGGAtCTGGGGTGGGAGTAACAGGGCAGAAATGATTAAAATGTTTGAGCAC . 

i , r 

The nucleic acid sequence of NOV14a maps to chromosome 16pl 1.2 and has 1229 of 
1238 bases (99%) identical to a Homo sapiens prbstasin mRNA (gb:GENBANK- 
ID:HUMPROS|acc:L41351 .1) (E = 3.3e" 270 ). 
5 A NOV14a polypeptide (SEQ ID NO:30) encoded by SEQ ID NO:29 is 307 amino 

acid residues and is presented using the one letter code in Table 14B. Signal P, Psort and/br 
Hydropathy results predict that NOV 14a contains a signal peptide and is likely to be localized 
at the plasma membrane' with a certainty of 0.8812. The most likely cleavage site for a 
NOV 14a peptide is between amino acids 29 and 30, at: GTG-AE. 

Table 14B. NOV14a protein sequence (SEQ ID NO:30) 

r4AQKGVLGPGQLGAVAILLYLGL.I J RSGTGAEGAEAPCGVAPQARITGGSSA^GQWPWQVSITyEGVHVCGGSL.VSEQWVLSA 
AHCFPSEHHKGSQGDIALLQLSRPTSYSRYfRPICLPAANASFPNGLHCTVTGWGHVAPSVSLLTPKPLQQLEVPLISRETCN 
CLYNIDAKPEEPHFVQEDMVCAGYVEGGKDACQGDSGDPLSCPVEGLWYLTGIVSWGDACGARNRPGVYTLASSYASWIQSKV 
^ I TELQPRWPQTQESQPDSNLCGSHLAFSSAPAQGLLRPILFLPLGIoALGLLSPWIjSEH [ 

The NOV14a amino acid sequence.has 212 of 220 amino acid residues (96%) identical 
to, and 215 of 220 amino acid residues (97%) similar to, a Homo sapiens 343 amino acid . 
residue prostasin precursor (EC 3.4.21.-) (ptnr:SWISSPROT-ACC;Q16651) (E = 6.2e" 166 ). 
Additional to the 4 amino acids mismatch, the NOV 14a sequence lacks 36 internal amino 

1 5 acids, when compared to the Homo sapiens prostasin precursor (EC 3 .4.2 1 .-) 
(ptnr:SWISSPROT~ACC:Q16651). 

NOV14a is expressed in at least the following tissues: Adrenal gland, bone marrow, 
brain - amygdala, brain - cerebellum, brain - hippocampus, brain - substantia nigra, brain - 
thalamus, brain -whole, fetal brain, fetal kidney, fetal liver, fetal lung, heart, kidney, 

20 lymphoma - Raji, mammary gland, pancreas, pituitary gland, placenta, prostate, salivary 

gland, skeletal muscle, small intestine, spinal cord, spleen, stomach, testis, thyroid, trachea, 
uterus, Ascending Colon, Bronchus, Colon, Duodenum, Gall Bladder, Kidney Cortex, Liver, 
Lung, Lung Pleura, Ovary, Parathyroid Gland, Parotid Salivary glands, Peripheral Blood, 
Respiratory Bronchiole, Thymus, and Whole Organism. This information was derived by 

25 determining the tissue sources of the sequences that were included in the invention including 
but not limited to SeqCalling sources, Public EST sources, Literature sources, and/or RACE 
sources. In addition, NOV 14a is predicted to be expressed in seminal fluid because of the 
expression pattern of a closely related Homo sapiens prostasin mRNA homolog 
(gb : GENB ANK-ID :HUMPROS | acc : L4 1351.1), 

30 NOV14b 
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A disclosed NOV14b nucleic acid of 1 161 nucleotides (designated CuraGen Acc. No. 
CG56123-02) encoding a novel prostasin precursor : like protein is shown in Table 140. An 
open reading frame was identified beginning with an ATG initiation codoh at nucleotides 1-3 
and ending with a TGA codon at nucleotides 1 1 59-1 161. The start and stop codons are in bold 
letters in Table 14C. 



Table 14C. NOV14b Nucleotide Sequence (SEQ ID NO:31) 

ATGGCCCAGAAGGGGGTCCTGGGGCCTdGGCAGCTGGGGGCTGTGGCCATTCTGCTCTATCTTGGATTACTCCGGTCGG 

ggacaggagcggaaggggcagaagggaa'tgcccagattggtgtggtggaaacagacacagttgtggactccggaacaat 
tgtgagactgggcagcatcggggagcatggtggaggttggggggtgctgggctcggaggcaatgcccaactcagccagc 

AGAAGCCCG|CCTGTCTACGGAGGGGCTGCTGTTCCAATCAAGCATCCACAGGCAAAGGGAAOTTTGCCAGACACTCCCT 
GCGGTGTGGCCCCCCAAGCACGCATCACAGpTGGCAGCAGTGCAGTCGCCGGTCAGTGGCCCTGGCAGGTCAGCATCAC 
CTATGAAGGCGTCCATGXG't'GTGGTGGCTCTCTCGTGTCTGAGCAGTGGGTGCTGTCAGCTGCTCACTGCTTCCCCAGC 
GAGCACCACAAGGAAGCCTATGAGGTCAAGCTGGGGGCCCACCAGCTAGACTCCTACTCCGAGGACGCCAAGGTCAGCA 
CCCTGAAGGACATCATCCCCCACCCCAGCTACCTCCAGGAGGGCTCCCAGGGCGACATTGCACTCCTCCAACTCAGCAG 
ACCCATCACCrTCTCCCGCTACATCCGGCCCATCTGCCTCCCTGCAGCCAACGCCTCCTTCCCCAACGGCCTCCACTGC 
ACTGTCACTGGCTGGGGTCATGTGGCCCCCTCAGTGAGCCTCCTGACGCCCAAGCCACTGCAGCAACTCGAGGTGCCTC 
TGATCAGTCGTGAGACGTGTAACTGCCTGTACAACATCGACGCCAAGCCTGAGGAGCCGCACIJTTGTCCAAGAGGACAT 
GGTGTGTGCTGGCTATGTGGAGGGGGGCAAGGACGCCTGCCAGGGTGACTCTGGGGGCCCACTCTCCTGCCCTGTGGAG 
GGTCTCTGGTACCTGACGGGCATTGTGAGCTGGGGAGATGCCTGTGGGGCCCGCAACAGGCCTGGTGTGTACACTCTGG 
CCTCCAGCTATGCCTCCTGGATCCAAAGCAAGGACTCTGCGGGAGGCTGGGGCCCCATCTTGATCTTTGAGCCCATTCT 
TCTGGGTGTGCTTTTTGGGACCATCAqTGkGAGTCAGGAGTTTTACTGCCTGTAG 



The nucleic acid sequence of NOyi4b maps to chromosome 16 and has 766 of 776 
bases (98%) identical to a Homo sapiens prostasin mRNA (gbiGENBANK.- 
ID:HUMPROS|acc:L41351.1) (E = 5,5e~ 207 ). 

A NOV14b polypeptide (SEQ ID NO:32) encoded by SEQ ED NO:31 is 386 amino 
acid residues and is presented using the one letter code in Table 14D. Signal P, Psort and/or 
Hydropathy results predict that NOV 14b contains a signal peptide and is likely to be localized 
at the plasma membrane with a certainty of 0.8434. The most likely cleavage site for a 
NOV14b peptide is between amino acids 29 and 30, at: GTG-AE. 



Table 14D. NOV14b protein sequence (SEQ IB NO:32) 

"maqkgvlgpgqlgavaillylgllrsgtgaegaegnaqigwetdtwdsgtivrlgsigehgggwgvlgseampnsasrspp 

VYGGAAVPIKIIPQAKGSXjPDTPCGVAPQARITGGSSAVAGQWPWQVSITYEGVHVCGGSLVSEQWVLSAAHCFPSEHHKEAYE 
VKLGAHQLDSYSEDAKVSTLKDIIPHPSYLQEGSQGDIALLQLSRPITFSRYIRPICLPAANASFPNGLHCTVTGWGHVAPSV 
SLLTPKPLQQbEVPLISRETCNCLYNIDAKPEEPHPVQEDMVCAGYVEGGKDACQGDSGGPLSCPVEGLWYLTGIVSWGDACG 
ARNRPGVYTIASSYASWIQSKDSAGGWGPILIFEPILLGVLFGTITESQEFYCL 

The NOV 14b amino acid sequence has 251 of 257 amino acid residues (97%) identical 
to, and 254 of 257 amino acid residues (98%) similar to, a Homo sapiens 343 amino acid 
residue prostasin precursor (EC 3.4.21.-) (ptnr:SWISSPROT-ACC:Q16651) (E - 1.3e" 154 ). 

NOV14b is expressed in at least the following tissues: prostate, liver, salivary gland, 
kidney, lung, pancreas, colon, bronchus, renal proximal tubular cells and prostate carcinoma 
LNCaP cells. This information was derived by determining the tissue sources of the 
sequences that were included in the invention including but not limited to SeqCalling sources, 
Public EST sources, Literature sources, and/or RACE sources and the expression pattern of a 
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closely related Homo sapiens prostasin mRNA homolpg (gb:GENBANK- 
ID:HUMPROS|acc :L4 1 35 1 . 1 ). 

NOV14 Assemblies 

An insert assembly 16226271 1, Table 14E, was found to encode an open reading frame 
5 between residues 30 and 283 of the target sequence of NOV14a (CG56123-01). The cloned 
insert differs from the original sequence by 36-amino acid insertion between amino acids 93 
and 94 of NOV14a, and a replacement of the amino acids TSY by ITL at positions 108-1 10. 
An insert assembly 162262716, Table 14F, differs from the insert assembly 162262711 by one 
amino acid (ITF instead of ITL at positions 108-1 1 0 of NOV14a). 



Table 14E. 162662711 Nucleotide Sequence (SEQ ID NO:33) 

1 ' AGATCTGCGGAAGGGGCAGAAGCTCCCTGCGGTGTGGCCCCCCAAGCACGCATCACAGGTGGC 
AGCAGTGCAGTCGCCGGTCAGTGGCCCTGGCAGGTCAGCATCACCTATGAAGGCGTCCATGTG 
TGTGGTGGCTCTCTCGTGTCTGAGCAGTGGGTGCTGTCAGCTGCTCACTGCTTCCCCAGCGAG 
CACCACAAGGAAGCCTATGAGGTCAAGCTGGGGGCCC^ACCAGCTAGACTCCTACTCCGAGGAC 
GC C AAGG T C AG C AC C CTG AAG G ACATC AT C C CC CACC C C AG CT AC C TC C AG G AGGG C TC CC AG 
GGCGACATTGCACTCCTCCAACTCAGCAGACCCATCACCCTCTCCCGCTACATCCGGCCCATC 
TGCCTCCCTGCAGCCAACGCCTCCTTCCCCAACGGCCTCCACTGCACTGTCACTGGCTGGGGT 
CATGTGGCCCCCTCAGTGAGCCTCCTGACGCCCAAGCCACTGCAGCAACTCGAGGTGCCTCTG 
ATCAGTCGTGAGACGTGTAACTGCCTGTACAACATCGACGCCAAGCCTGAGGAGCCGCACTTT 
GTCCAAGAGGACATGGTGTGTGCTGGCTATGTGGAGGGGGGCAAGGACGCCTGCCAGGGTGAC 
TCTGGGGGCCCACTCTCCTGCCCTGTGGAGGGTCTCTGGTACCTGACGGGCATTGTGAGCTGG 
GGAGATGCCTGTGGGGCCCGCAACAGGCCTGGTGTGTACACTCTGGCCTCCAGCTATGCCTCC 
TGGATCCAAAGCAAGGTGACAGAACTCCAGCCTCGTGTGGTGCCCCAAACCCAGGAGTCCCAG 
CCCGACAGCAACCTCTGTGGCAGCCACCTGGCCTTCAGCTCTGCCCCAGCCCAGGGCGTCGAC 

10 



Table 14F. 162662716 Nucleotide Sequence (SEQ ID NO:34) 

AGATCTGCGGAAGGGGCAGAAGCTCCCTGCGGTGTGGCCCCCCAAGCACGCATCACAGGTGGC 
AGCAGTGCAGTCGCCGGTCAGTGGCCCTGGCAGGTCAGCATCACCTATGAAGGCGTCCATGTG 
TGTGGTGGCTCTCTCGTGTCTGAGCAGTGGGTGCTGTCAGCTGCTCACTGCTTCCCCAGCGAG 
CACCACAAGGAAGCCTATGAGGTCAAGCTGGGGGCCCACCAGCTAGACTCCTACTCCGAGGAC 
GCCAAGGTCAGCACCCTGAAGGACATCATCCCCCACCCCAGCTACCTCCAGGAGGGCTCCCAG 
GGCGACATTGCACTCCTCCAACTCAGCAGACCCATCACCTTCTCCCGCTACATCCGGCCCATC 
TGCCTCCCTGCAGCCAACGCCTCCTTCCCCAACGGCCTCCACTGCACTGTCACTGGCTGGGGT 
CATGTGGCCCCCTCAGTGAGCCTCCTGACGCCCAAGCCACTGCAGCAACTCGAGGTGCCTCTG 
ATC AG TCGTG AG ACG TGT AAC TG C CTGTAC AAC AT CG ACGC C AAG C C TG AG GAG C C G C A CTTT 
GTCCAAGAGGACATGGTGTGTGCTGGCTATGTGGAGGGGGGCAAGGACGCCTGCCAGGGTGAC 
TCTGGGGGCCCACTCTCCTGCCCTGTGGAGGGTCTCTGGTACCTGACGGGCATTGTGAGCTGG 
GGAGATGCCTGTGGGGCCCGCAACAGGCCTGGTGTGTACACTCTGGCCTCCAGCTATGCCTCC 
TGGATCCAAAGCAAGGTGACAGAACTCCAGCCTCGTGTGGTGCCCCAAACCCAGGAGTCCCAG 
CCCGACAGCAACCTCTGTGGCAGCCACCTGGCCTTCAGCTCTGCCCCAGCCCAGGGCGTCGAC 



Possible small nucleotide polymorphisms (SNPs) foimd for NOV14a are listed in 
Tables 14G and 14H. 



Table 14G: SNPs 
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Consensus 


Depth 


Base 


PAF 


Position 




Change 




188 


, 9 


G> T 


0.222 


190 


9 


G>C 


0.222 


196 


9 


OG • 


• 0.222 


197 


9 


A> G 


0.222 


203 


9 


->C 


0.222 


212 


9. 


. ->c 


0.222 


212 


9 


->T 


0.222 


291 


1 To - 


->C 


0.300 


292 


10 


->c 


0.300 


332> 


10 


T>C 


0.200 


555 


8 


G > A 


0.250 



Table 14H: SNPs 


Variant 


Nucleotide 
Position 


Base 
Change 


Amino 
Acid 
Position 


Base 
Change 


13376700 


373 


A > G 


49 


Ser>Gly 


13376699 


447 


T>C 


Silent 


N/A 


13374234 


617 


T>C 


130 


Leu > Pro 


13376698 


690 


A > G 


Silent 


N/A 


13374602 


698 


T>C 


157 


Val > Ala 


13374601 


737 


A>C 


170 


Asn > Tin- 


13374235 


839 


A > G 


204 


Asp > Gly 


13374600 


1090 


A > G 


288 


He > Val 



NOV14a and NOV14b are very closely homologous as is shown in the amino acid 
alignment in Table 14L 

Table 141 Amino Acid Alignment of NOV14a and NOV14b 



N0Vl4a 
NOVl4b 



20 

J i . 



30 



MAQKGVriGPGQLGAVArLLYIiGLLRSGTGAEGAE 
MRQKGVIxGPGQLGAVAlLIiYLGIjtjRSGTGREQAEi 



40 50 
.|....|....|....J. 



60 



•I- 



70 



90 



NOV14a 

NOV14b SEAMPNSASRSPPVYGGAAVPIKHPQAKGSLPDT 



INAQ I G WETDT WD SGT I VRLGS I G EH GG GWG VLG 
100 110 120 130 140 



PCGVAPQARITGGSSAVAGQWPWQVSITYEGVHVCG 
PCGVAPQARITGGSSAVAGQWPWQVSITYEGVHVCG 



NOVl4a 
NOVl4b 



NOV14a 
NOV14b 



150 



170 



180 



190 



200 210 

..|.... )■.•■! 



GSQGDIALLQLS 
GSQGDIALLQLS 



220 



NOVl4a SjTS^ 
NOV14b SJlTF 



IEAYEVKLGAHQLDSYSEDAKVSTLKDI I PHPSYLQEg 

260 270 280 



230 240 250 
1 1 1 1 j . 



SRYIRPICLPAft ^FPgGLHCTVTGWGHVAPSVSLLTPKPLQQLEVPLI SRET 
SRYIRPICLPAA^SFP^GIiHCTVTGWGHVAPSVSLLTPKPLQQLEVPLISRST 



300 



eephfvqedmvcagyveggkdacqgpsggplscpveglwyltgivswgdacg 
eephfvqedwvcagyveggkdacqgpsgSplscpveglwyltgivswgdacg, 



390 



320 330 340 350 

. I — I — ( — I — L ... |....[ 



rpgvytlassyaswi 
rpgvytlassyaswi 
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NOV14a 
NOVl4b 



. 1 .... I .... I .... I | . . | . . | . ., . | | . . . . | . . ■ 

.-DSAG6- - WgJplSigEP- - - - -ffBLtSyBBGTjlTEgQEPYCIi- 



Homologies to any of the above NOY,14 proteins will be shared by the other NOV14 
proteins insofar as they are homologous to each other as shown above. Any reference to 
NOV 14 is assumed to refer to both of the NOV 14 proteins in general, unless otherwise noted. 

NOV 14a also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 14J. 1 



Table 14J. BLAST results for NOV14a 



Gene Index/, 
Identifier 



Protein/ Organism 



Length 
(aa) 



Identity 
(%) 



Positives 
(%) 



Expect 



gi | 4506153 [ref |NP 0 
02764 .1 | 
[NM 002773) 



protease, serine, 
8 (prostasin) 
[Homo sapiens] 



343 



278/318 
(87%) 



280/318 
(87%) 



"217/318 
(68%) 



e-148 



gi j 11181573 | gb \ AAG3 
2641.1|AF202076 1 
(AF202076.) 



prostasin [Rattus 
norvegicus] 



342 



240/318 
(75%) 



Prostasin 
precursor [Rattus 
norvegicus] 



gi)!3632973 |sp[Q9ES 
87|PSS8 RAT 



342 



217/318. 
(68'%) 



239/318 
(74%) 



e-112 



gi } 13277969 |gb|AAH0 
3851.ljAAH03851 
(BC003851) 



gi|!3633203|sp|Q9ES 
D1|PSS8 MOUSE 



Similar to 
protease, serine, 
8 (prostasin) 
[Mus musculus) 



339 



218/317 
(68%) 



240/317 
(74%) 



e-112 



Prostasin 
precursor 
(Channel 
activating 
protease 1) [Mus 
musculus] 



342 



217/317 
(68%) 



239/317 
(74%) 



e-111 
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The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 14K. 



Table 14K ClustalW Analysis of NOVK 

1) NOV14a(SEQIDNO:30) 

2) gi[4506153lref1NP 002764. 1| (NM_002773) protease, serine, 8 (prostasin) [Homo sapiens] (SEQ JD 
NO; 150) 

3) gi]l 11 8 1 573|g b [AAG3264 1 . 1 1AF202076 1 (AF202076) prostasin [Rattus norvegicus] (SEQ ID NO: 151) 

4) Eill3632973lsp|Q9BS871PSS8 RAT Prostasin precursor [Rattus norvegicus] (SEQ ID NO: 152) 

5) gil l3277969lgblAAH03851.11AAH03851 (BC003851) Similar to protease, serine, 8 (prostasin) [Mus 
musculus] (SEQ ID NO: 1 53) 

6) gj[13633203 )sp lO9BSDHPSS8 MOUSE Prostasin precursor (Channel activating protease 1) [Mus musculus] 
(SEQ ID NO: 154) 



NOV14a 
gij4506153| 
gij 11181573 [ 
gijl3632973 j 
gij 13277969 | 
gi|13633203 | 



N0V14a 




90 



100 



110 



130 



. 102 
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gi|4506153 | 
gijlll81573 | 
gi|l3632973 j 
gijl3277969 j 
gi j 13633203 j 



NOV14a 
gi |4506153 j 
gi j 11181573 | 
gi j 13632973 | 
gi | 13277969 j 
gij 13633203 j 



NOV14a 
gi | 4506153 | 
gi jlll81573 j 
gij 13632973) 
gij 13277969[ 
gi {13633203 [ 



N0V14a 
gi}4506153 | 
gi 1 11181573 j 
gi|l3632973 j 
gi jl3277969) 
gi 1 13633203 j 




IKEfflvgELQPRWPQTQESQPD 
I ^S V USLQPRWPQTQESQP 
IHHHVAELQPRVVPQTQESQPDGHLdjjHHP 

ihhhvaelqpr2!vpqtqksqpdghloShhpv 
ihhhvaelqprwpqtqesqpdghlc^hhpvfs 
ihhhvaelqprwpqtqesqpdghlcShhpvpss 



290 



300 





Tables 14L and 14M list the domain description from DOMAIN analysis results 
against NOV14a. This indicates that the NOV14a sequence has properties similar to those of 
other proteins known to contain these domains., 



Table 14L. Domain Analysis of NOV14a 

gnl \ Smart \ smartQ002Q , Tryp_SPc, Trypsin- like serine protease; Many of 
these are synthesised as inactive precursor zymogens that are cleaved 
during limited proteolysis to generate their active forms. A few, 
however, are active as single chain molecules, and' others are inactive 
due to substitutions of the catalytic triad residues. (SEQ ID NO: 155) 
Length = 230 residues, 100.0% aligned 
Score = 205 bits (52 2) , Expect « 3e-54 



NOV14a: 


44 


00020; 


1 


N0V14a : 


91 


00020: 


61 


NOV14a: 


129 


00020 : 


121 


NOV14a: 


189 


00020: 


174 



RI TGGSS AVAGQWPWQVS I TYE - GVHVCGGSLVSEQWVLSAAHCFPSE 90 

II III I I +11111+ I I I HUM +IM + IIII 

RIVGGSEANIGSFPWQVSLOYRGGRHFCGGSIjISPRWVLTAAHCVYGSAPSSIRVRLGSH 6 0 



HHKGS QGDIALLQLSRPTSYSRYIRPICLPAANASFPN 128 

.1 + lllll+ll I + I +IIIIII++ + I 

DLSSGEETQTVKVSKVIVHPNYNPSTYDNDIALLKLSEPVTLSDTVRPICLPSSGYNVPA 120 



II++ II++I 



+ ++I+II 
-SGGPAITDNMLCA 173 



III 



II 
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Table 14M. Domain Analysis of NOV14a 

gnl |Pfamjpfam00089 , trypsin, Trypsin. Proteins recognized include all 
proteins in families SI, S2A, S2B, S2C, and S5 in the classification 
of peptidases. Also included are proteins' that are' clearly members, 
but that lack peptidase activity, such as haptoglobin and protein Z 
(PRTZ* ) . (SEQ ID NO:156) 
Length » 217 residues, 100.0% aligned 

Score - 177 bits (450) , Expect - 6e-46 

NOV14a: 45 
00089: 1 
NOV14a: 92 
,00089: 61 
NOV14a: 134 
00089: 121 
NOV14a: 194 
00089: 169 

Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being 
found in viruses, bacteria and eukaryotes (Rawlings N.D., Barrett A.J. Families of Serine 
5 Peptidases. Meth. Enzymol. 244: 19-61,1 994). They include a wide range of peptidase 

activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. 
Over 20 families (denoted SI - S27) of serine protease have been identified, these being 
grouped into 6 clans on the basis of structural similarity and other functional evidence 
(Rawlings et al. 1994). 

1 0 Tryptase is a tetrameric serine protease that is concentrated and stored selectively in 

the secretory granules of all types of mast cells, from which it is secreted during mast cell 
degranulation. Its exclusive presence in mast cells permits its use as a specific clinical 
indicator of mast cell activation by measurement of its level in biologic fluids and as a 
selective marker of intact mast cells using imiminoliistochemical techniques with antitryptase 

15 antibodies. Vanderslice (Vanderslice et al., Human mast cell tryptase: multiple cDNAs and 
genes reveal a multigene serine protease family. Proc. Nat. Acad. Sci. 87: 381 1-3815, 1990) 
demonstrated the existence of multiple tryptases. In this respect, mast cell tryptase is like other 
serine proteases such as glandular kallikrein and trypsin, which are also members of multigene 
families. 

20 Miller et al. (Miller et ah, Cloning and characterization of a second complementary 

DNA for human tryptase. J. Clin. Invest. 86: 864-870, 1990) mapped both alpha-tryptase and 
beta-tryptase to human chromosome 16 by PCR analysis of DNA from human/hamster 
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somatic cell hybrids. Miller et al. (1990) cloned a second cDNA for human tryptase, called 
beta-tryptase, from a mast cell cDNA library. The 1,142 bases of beta-tryptase were found to 
encode a 30-amino acid leader sequence of 3 5 Q89 daltons and a 245-amino acid catalytic 
region of 27,458 daltons. The amino acid sequence of beta-tryptase was found to be 90% 
5 identical with that of alpha-tryptase, the first 20 amino acids of the catalytic portions being 
100% identical. Both alpha- and beta-tryptase sequences were localized to human 
chromosome 16 by analysis of DNA preparations from' 25 human/hamster somatic cell hybrids 
byPCR. 

Yu et al. have purified a novel human serine proteinase, designated as prostasin, from 
10 seminal fluid (Yu et al., Molecular cloning, tissue- specific expression, and cellular localization 

of human prostasin mRNA. J Biol Chem 270(22): 13483-9, 1995). A full-length cDNA 

i 

sequence encoding prostasin was obtained by amplification of the 5'- and 3 ? -ends of the 
cDNA. It contains a 1,032-base coding region, a 572-base 3'-noncoding region and a 138-base 
5'-noncoding sequence. Prostasin cDNA encodes a protein of 343 amino acids, which consists 

15 of a 32-amino acid signal peptide and a 3 11 -amino acid proprostasin. Proprostasin is then 
cleaved between Argl2 and He 13 to generate a 12-amino acid light chain and a 299-amino 
acid heavy chain, which are associated through a disulfide bond. The deduced amino acid 
sequence of the heavy chain has 34-42% identity to human acrosin, plasma kallikrein, and 
hepsin. A potential N-glycosylation site at Asnl27 and the catalytic triad of His53, Aspl02, 

20 and Ser206 have been identified. The deduced prostasin has a unique 19-amino acid 

hydrophobic portion at the COOH terminus, which makes it suitable to anchor in the cell 
membrane. Carboxyl -terminal sequencing of purified prostasin indicates that the hydrophobic 
portion is removed and that there is a cleavage between Arg290 and Pro291 during secretion. 
Southern blot analysis, following a reverse transcription polymerase chain reaction, indicates 

25 that prostasin mRNA is expressed in prostate, liver, salivary gland, kidney, lung, pancreas, 
colon, bronchus, renal proximal tubular cells, and prostate carcinoma LNCaP cells. Cellular 
localization of prostasin mRNA was identified within epithelial cells of the human prostate 
gland by in situ hybridization histochemistry. 

Because of the presence of the trypsin domains and the homology to the tryptase, we 

30 anticipate that the novel sequence described here will have useful properties and functions 
similar to these genes. 

Human seminal fluid is a rich source of proteolytic enzymes, many of which are 
involved in the postejaculatory hydrolysis of proteins and in semen coagulation and 
liquefaction. Prostate-specific antigen and acrosin are two of the most important proteolytic 
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enzymes found in human semen. Prostate-specific antigen may play an important role in 
semen liquefaction through hydrolyzing semenogelin, a predominant seminal vesicle protein. . 
Prostate-specific antigen levels in blood have been recognized recently as the most important 
marker for prostate cancer. Acrosin is a serine proteinasepresent in acrosomes, where it 
5 covers the anterior part of the sperm head. It is believed to be involved in recognition, binding, 
and penetration of the zona pellucida of the ovum during interaction of the sperm and egg. The 
serine proteinase prostasin, has been identified and t purified from human seminal fluid. At the 
present time, the physiological functions of prostasin are unknown, and its physiological 
substrate remains to be identified. Prostasin has an apparent molecular mass of 40 kDa on 

10 SDS-polyacrylamide gel electrophoresis and displays arginine amidolytic activity. The N- 
terminal 20-amino acid sequence of prostasin shares 50-55% identity with human alpha- 
' tryptase, elastase 2A and 2B, chymotrypsin, acrosin, and the catalytic chains of hepsin, plasma 
kallikrein, and coagulation factor XL It is present in many tissues and has the highest level in 
the prostate gland. In the prostate gland, prostasin has been localized in epithelial cells and 

15 ducts by immunohistochemistry. It is believed that prostasin ( is synthesized in prostatic 

epithelial cells, secreted into the ducts, and excreted into the seminal fluid, where it may serve 
a role in fertilization. The wide distribution of prostasin outside the prostate gland indicates 
that it may also play important roles in other biological processes. The full-length PRSS8 gene 
has been isolated and characterized. A 7-kb PRSS8 gene fragment has been sequenced, 

20 including a 1.4-kb 5-flanking region, the 4.4-kb PRSS8 gene, and a 1.2-kb 3'-flanking region. 
The gene consists of six exons and five introns based on comparison with its cDNA sequence. 
The sizes of these exons are 417, 18, 163, 272, 167, and 899 bp, while those of the introns are 
243, 1763, 271, 85, and 92 bp. A number of potential regulatory elements have been revealed 
in the 5'-flanking region, including an AP2 site, two erythroid-specific promoter elements, and 

25 a sterol regulatory element. In addition, there are a variant GC box and a variant API site in 

the promoter region. The transcription initiation site of the PRSS8 gene has been defined at the 
G residue and its adjacent A residue in a sequence CTCATGACT, which is similar to an 
initiator element CTCANTCT. Between the transcription initiation site and these putative 
regulatory elements, there is an AC-rich repetitive sequence that spans over 300 bp. Human 

30 PRSS8 is a single-copy gene and has been localized on chromosome 16pl 1 .2 by in situ 
hybridization. 

The full-length cDNA sequence encoding prostasin contains a 1,032-base coding 
region, a 572-base 3'-noncoding region and a 138-base 5-noncoding sequence. Prostasin 
cDNA encodes a protein of 343 amino acids, which consists of a 32-amino acid signal peptide 
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and a 31 1 -amino acid proprostasin. Proprostasin is then cleaved between Argl2 and Ilel3 to 

generate a 12-amino acid light chain and a 299 -amino acid heavy chain, which arc associated * 

through a disulfide bond. The deduced amino acid sequence of the heavy chain has 34-42% 

identity to human acrosin, plasma kallikrein, and hepsin. A potential N-glycosylation site at 

Asnl27 and the catalytic triad of His53, Aspl02, and Ser206 have been identified. The 

deduced prostasin has a unique 19-amino acid hydrophobic portion at the COOH terminus, 

which makes it suitable to anchor in the cell membrane. Carboxyl-terminal sequencing of 

purified prostasin indicates that the hydrophobic portion is removed and that there is a 

cleavage between Arg290 and Pro291 during secretion. Southern blot analysis, following a 
i i 
reverse transcription polymerase chain reaction, has indicated that prostasin mRNA is 

expressed in prostate, liver, salivary gland, kidney, lung, pancreas, colon, bronchus, renal 

proximal tubular cells, and prostate carcinoma LNCaP cells. Cellular localization of prostasin 

mRNA has been identified within epithelial cells of the human prostate gland by in situ 

hybridization histochemistry. 

The broad existence of prostasin mRNA in human tissues suggests that it may have 
important biological functions. Localization of prostasin mRNA in the epithelial cells of the 
prostate gland indicates that prostasin is synthesized in the cells and then secreted into the 
ducts. The presence of prostasin in prostatic epithelial cells and ducts has been identified by 
immunohistochemistry. Since it is likely to be a membrane-bound serine proteinase, prostasin 
may be involved in some important processes on the surface of cell membranes, such as 
removal of propeptides from hormones and growth factors and the activation of proenzymes 
associated with membranes. 

The above defined information for NOV14 suggests that this NOV14 protein may 
function as a member of a prostasin protein family. Therefore, the NOV 14 nucleic acids and 
proteins of the invention are useful in potential therapeutic and diagnostic applications. For 
example, a cDNA encoding the NOV 14 protein may be useful in gene therapy, and the 
NOV 14 protein may be useful when administered to a subject in need thereof. By way of 
nonlimiting example, the compositions of the present invention will have efficacy for 
treatment of patients suffering from fertilization disorders, conditions resulting from defective 
removal of propeptides from hormones and growth factors and/or activation of proenzymes 
associated with membranes. The NOV 14 nucleic acid encoding prostasin-like protein, and the 
prostasin-like protein of the invention, or fragments thereof, may further be useful in 
diagnostic applications, wherein the presence or amount of the nucleic acid or the protein are 
to be assessed. 
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NOV15 

A disclosed NOVl 5 nucleic acid of 29^0 nucleotides (designated CuraGen Acg. No. 
CG50153-01/AC025263_jia2) encoding a novel Low Density Lipoprotein Receptor (LDLR> 
like protein is' shown in Table 1 5A. An open reading frame was identified beginning with an 
ATG initiation codon at nucleotides 1-3 and ending with a TAA codon at nucleotides 293 8- 
2940. The start and stop codons are in bold letters in Table 15 A. 



Table 15A. NOV15 Nucleotide Sequence (SEQ ID NO:35) 



atggccaccgcggcAacctcacccgcgctgaagcggctggatctgcgcgaccctgcggctcttttcgagacgcatggag 

CGGAGGAGAjTCCG CGGGCTGGAG CG CCAGGTTCGGG CCG AG AT CG AG C ACAAGAAGG AGG AGCTGCGG CAG ATGGTGGG 

cghacggtaccgcgacctgatcgaggcggcpgacaccatcggccagatgcgccgctgcgccgtggggctagtggacgcc 
gtgaaggccaccgaccagtactgcgcccgcctccgccaggccggctcggccgcgccccggccaccgcgggcccagcagc 
cacagcagccatcccaagagaagttctacagcatggctgcccagatcaagctactcttagaaattccggagaagatctg 
gagctcgatggaagcctctcagtgtctccacgccacacagctctacctgctctgctgccacctccacagcctgctccag 
ctggattcttctagttcccgatacagtcccgtcctctccccggttcctatactcatccggcaggtggcagccgccagcc 
acttccggtcaactattctgcatgaaagcaagatgttgctcaaatgccaaggtgtgtctgaccaagctgtggccgaggc 
cctgtgctctataatgctcttagaagagagttctcctcgccaagccctcacagacttcctgctggccagaaaggcaact 
attc^gaaacttctcaaccagccacaccatggtgctggtatcaaggctcagatttgctcattagtggagttgctggcca 
ccactctgaagcaagctcatgcccttttctacactttgccagaaggactgctgccagatccagccctgccatgtggctt 
gctcttctctactctggagaccatcacaggccagcatcctgccaagggcactggtgtcctgcaggaagagatgaaactc 
tgcagctggtttaaacacctgccagcatccatcgtcgagttccagccaacactccgaacccttgcacatcccatcagtc 
aggaatacctgaaagacacgctgcagaaatggatccacatgtgtaatgaagacattaaaa^tgggatcaccaacctgct 

CATGTACGTGAAGAGCATGAAGGGTCTCGCGGGAATCCGGGACGCCATGTGGGAGTTACTTACCAATGAGTCCACCAAT 
CACAGCTGGGATGTGCTATGTCGGCGGCTTCTGGAGAAGCCGCTCTTGTTCTGGGAAGATATGATGCAGCAACTGTTCC 
TTGACCGATTACAGACTCTGACAAMGAAGGCTTTGACTCCATCTCCAGTAGCTCCAAGGAGCTCTTGGTTTCAGCTTT 
GCAGGAACTTGAAAGCAGCACCAGCAACTCCCCTTCAAATAAGCACATCCACTTTGAGTACAACATGTCGCTCTTCCTC 
TGGTCTGAGAGTCCTAATGACCTGCCTTCCGATGCGGCCTGGGTCAGCGTGGCAAACCGGGGTCAGTTAGGGGTCGCTG 
GCCTCTCTATGAAAGCACAAGCCATCAGCCCTTGTGTACAGAACTTCTGTTCTGCCCTGGATTCTAAGCTGAAGGTTAA 
ACTAGATGACCTCCTGGCTTACCTCCCCTCTGATGACTCATCACTGCCCAAGGACGTTTCTCCCACACAGGCCAAGAGT 

tctgcctttgacagatacgcagatgcggggaccgtgcaggagatgctgcggactcagtccgtggcatgcatcaagcaca 
tcgtggactgcatccgggcagagctacagagcattgaagaaggtgtgcaagggcaacaggatgccctcaacagtgccaa 
gctgcactcagttcttttcatggccagactctgccagtccctgggagagctgtgcccccatctgaagcagtgcatcctg 

GGAAAATCAGAGAGCTCAGAGAAACCAGCAAGGGAGTTTAGGGCTCTGAGAAAACAGGGAAAGGTGAAAACTCAGGAAA 

tcattcctacacaggccaagtggcaagaggttaaagaagtactcctccagcagagcgtgatgggctaccaggtctggag 
cagtgcagttgtgaaagttttgattcatggattcacccagtcattacttctagatgatgctggctcagttctggccaca 
gccaccagctgggatgagctagaaattcaggaggaggcagagtctggcagcagtgtcacatccaagatccgactccctg 
cacagccgtcctggtatgtacagtccttcctgtttagtttatgccaggaaattaatcgggttggaggccatgccttgcc 
aaaggtgacattacaggagatgctgaaaagctgtatggttcaagtagtagctgcctatgaaaaactctccgaagaaaaa 
cagattaagaaagaaggtgcatttccagtcacccagaaccgggcgctgcagctgctttatgatctgcgttacctcaaca 

TTGTTCTGACAGCCAAGGGTGACGAGGTGAAGAGTAGCCGGAGCAAGCCAGACTCCAGAATTGAGAAAGTGACTGACCA 
CCTGGAAGCCCTCATTGATCCATTTGACCTGGACGTTTTCACGCCACACCTCAACAGCAACCTTCATCGCCTGGTGCAG 
CGAACTTCTGTTCTGTTTGGATTGGTGACTGGTACAGAGAATCAGCTCGCCCCCCGGAGCAGTACGTTCAACTCCCAAG 
AACCCCATAACATCCTGCCGCTGGCATCCAGTCAGATCAGGTTTGGACTTCTCCCACTGAGCATGACAAGCACTCGAAA 
GGCTAAATCAACCAGAAACATCGAAACAAAAGCTCAGGTTGTCCCCCCGGCACGCTCCACAGCTGGTGACCCGACAGTT 
CCTGGCTCCTTGTT C AG ACAG CTTGTC AGTG AAG AAG AC AACACGTCTG C A C CTTCATTATTCAAACTTGG CTGGCTCT 
CTAGTATGACTAAGTAA 



The nucleic acid sequence of NOV15 maps to chromosome 17 and has 2804 of 2836 
bases (98%) identical to a Homo sapiens KIAA1381 mRNA (gb:GENBANX- 
10 ID:AB037802|acc:AB037802.1) (E = 0.0). 

A NOVl 5 polypeptide (SEQ ID NO:36) encoded by SEQ LO NO:35 is 979 amino acid 
residues and is presented using the one letter code in Table 15B. Signal P, Psort and/or 
Hydropathy results predict that NOVl 5 does not contain a signal peptide and is likely to be 
localized at the mitochondrial matrix space and the microbody (peroxisome) with a certainty 
15 of 0.4824. 
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Table 15B. NOV15 protein sequence (SEQ ID ]VO:36) 

u | < * * 

t^TAATSPALKRIjDLRDPAAIjFETHGAEEIRGLERQVRAEIEHKKEELRQm^ 
DQYCARX»RQAGSAAPRPPRAQQPQQPSQEKFYSMAAQIKLLLEI^^ 

SPVLS PVP I LI RQVAAASHFRSTI LHESKMLLKCQGVSDQAVAEALCS IMLLEES SPRQALTDFIiLTVRKA'TIQKLLNQPHHGA 
GIKAQICSLyELLATTLKOAHALFYTLPEGLLPDPALPCGLLFSTLETITGQHPAKGTGVLQEEMKLCSWFKHIiPAStVEFQP 
ThRThAHPlSQBYhKDThQKWXHMCNEDI KNG I TNLLMYVKSMKGLAGIRDAMWELLTNESTNHSWDVLCRRLLEKPLLFWED 
MMQQLFIJ3RLQTIjTKEGFDSISSSSKELLVSAI^ 

VAGLSMKAQAISPCVQNFCSALDSKLKVKLDDL.LAYLPSDDSSLPKDVSPTQAKSSAFDRYADAGTVQEMLRTQSVACIKHIV 
DCIRAELQSIEEGVQGQQDALNSAKLHSVLFMARLCQSLGELCPHLKQCILGKSESSEKPAREFRALRKQGKVKTQEIIPTQA 
KWQEVKEVLLQQSVMGYQVWSSAWKVLIHGFTQSLLLDDAGSVLATATSWDELEIQEEAESGSSVTSKIRLPAQPSWYVQSF 
LFSLCQEINRVGGHALPKVTLQEMLKSCMVQVVAAYEKbSEEKQIKKEGAFPVTQkRALQLLYDLRYLNIVLTAKGDEVKSSR 
SKPDSRIEKVTDHLEALlDPFDLDVFTPHLNSNLHRLVQRTSVLFGLVTGTENQIAPRSSTFNSQEPHNILPbASSQIRFGIiL 
PLSMTfiTRKAKSTRNIETKAQWPPARSyAGPPTVPGSLPRQLVSEEDNT^APSLFKLGWLSSMTK ■ ' ' , 

The NOV15 amino acid sequence has 804 of 980 amino acid residues (82%) identical 
to, and 878 ol*980 amino acid residues (89%) similar to, aMus musculus 980 amino acid 
residue LDLBP (ptnr:SPTREMBL-ACC:Q9Zl 60) (E = 0.0). 
5 NOV15 is expressed in at least the following tissues: ovaries, liver, epidermis, 

fibroblast and blood leukocytes. This information was derived by determining the tissue 
sources of the sequences that were included in the invention including but not limited to 
SeqCalling sources, Public EST sources, Literature sources, and/or RACE sources. In 
addition, NOV15 is predicted to be expressed in brain tissues because of the expression pattern 
10 of a closely related Homo sapiens KIAA1 381 mRNA homolog (gb:GENB ANK- 
ID:AB037802(acc:AB037802.1). 

Possible small nucleotide polymorphisms (SNPs) found for NOV15 are listed in Table 

15C. 



Table 15C: SNPs 


Variant 


Nucleotide 
Position 


Base 
Change 


Amino 
Acid 
Position 


Base 
Change 


13374274 


2485 


A > G 


829 


Ser > Gly 



15 NOV 15 also has homology to the amino acid sequences shown in the BLASTP data 

listed in Table 15D. 



Table 15D. BLAST results for NOV15 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
<aa) 


Identity 
(%> 


Positives 
(%) 


Expect 


gi| 15011849 |ref |NP 


low density 
lipoprotein B 
[Mus musculus] 


980 


767/983 
(78%) 


833/983 
(84%) 


0.0 


038609.2 | 
(NM 013581) 


gi | 7243143 |dbi | BAA9 


KIAA1381 protein 
[Homo sapiens] 


961 


858/935 
(91%) 


859/935 
(91%) 


0.0 


2619. 1| (AB037802) 


gi|l7459l6l|ref |XP 


low density 
lipoprotein 
receptor defect B 
complementing 
[Homo sapiens] 


962 


860/936 
(91%) 


861/936 
(91%) 


0.0 


040307 .2 | 
(XM_04 0307) 
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gi|H36029l|pir| | T5 


hypothetical 
protein 
DKFZp762L1710.1 
(fragment) [Homo 
sapiens] 


438 


437/438 
(99%) 


437/438 
(99%) 


0.0 


0629 


gi 1 3.5237322 |ref|NP 
197134.11 
(NC_003076) 


low density 
lipoprotein B- 
like protein 
[Arabidopsis 
thaliana] 


1068 ' 


83/313 
,(26%) 


134/313 
(42%) 


3e-17 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 15E. 



TabIe l5E ClustalW Analysis of NOV15 

'l),NOV15(SEQIDNO:36) , ' 

2) gj jlSOj 1849|reflNP 038609.21 (NM_013581) low density lipoprotein B [Mus musculus] (SEQ ID NO:157) 

3) gi| 72431431dbilBAA92619.11 (AB037802) K1AA13S1 protein [Homo sapiens] (SEQ ID NO:158) 

4) gi[17459161[rcftXP 040307.21 (XM_O403O7) low density lipoprotein receptor defect B complementing 
[Homo sapiens] (SEQ ID NO: 159) 

5) gi ] 11 36029 llpirliT50629 hypothetical protein DKFZp762L1710.1 (fragment) [Homo sapiens] (SEQ ID 
NO:160) 

6) gi|15237322|reflNP 197134.1 [ (NC_003076) low density lipoprotein B-like protein [Arabidopsis thaliana] 
(SEQ ID NO:161) 



N0V15 

gi|l5011849| 
gi | 7243143 | 
gi|l745916l| 
gi [11360291 
gi|l5237322| 



NOV15 

gi|l5011849| 
gi | 7243143 | 
gi[,17459l6l| 
gi 11360291 | 
gi jl5237322 j 



NOV15 

gi|l5011849| 
gi|7243143| 
gi|l7459161 j 
gi 1 11360291 1 
gi [15237322 j 



•I- 



10 

--MgAgTA 



I 



40 



50 
.].. 



60 
• ]_• • 



•]_• 



KRLDLRDPAT^FETHGAEEIRGLERQVi^EIEHKKEELRQMVGERyRDLI 
iLKRX^LRD^ALFETHGAEEIRGLERQVRAEIEHKKEELRQIWGERYRDLI 
IiKRLDIJU3PAM;FETHGAEEIRGLERQVRAEIEflKI^ELRQMVGERyRDLI 
LKRLDLRD PAALFETHGAEE IRGLERQVRAE I EHKKEELRQMVGERYRDL I 



MRMSSASAGEYRP;sg!vgLSSNGGGQ|2S A ES[3BR{jKPMS|35Slgl|SATl2! 
80 90 100 110 



• i • 



•I 



100 



120 
.1.. 



130 



EAADTIGQMRRCAffiGLVDAV gATDQYC^PJ^RQAGSAAPRgPRA toPQEPSgEKFYSMAAQIKLLLEIPE 
EAADTIGQMRRCAgGLVDAV BATDQYCARIiRQAGSS^PRBPRA jQPQgPS HEKFYSMAAQIKLLLEIPEP 
EAADTIGQMRROsSgLVDAV gATDQYCARIiRQAGSAAPRHPRA SqPqBpsBeKFYSMAAQIKIiLLEIPEP 

saadtigqmrrcaRglvdav "atdqycarlrqagsaapr@pra sqpqSpsBekfysmaaqikxjLLEIPe 



140 

L 




190 

.1.. 



200 



RYSPffiLSgEjPILIRQVAAASHFRSTILHES 
RYSPRlSRFPILIRQVAAASHFRSTTLHES 
RYSPffiLSRFPILIRQVAAASHFRSTILHES 

rys pRlsrfpilirqvaaashfrstilhes 



3^c|£e@mfBe^gr0^c^^ 



220 



230 



240 



270 



NOV15 

gi [15011849 | 
gi[7243143 | 
gif 17459161 | 
gi|H360291 j 
gi[l5237322 | 



NOV15 

gi|l5011849| 
gi|7243143| 
gi|l745916l| 
gi|ll36029l| 
gi [15237322 | 






2 










As 










* 


* 




1 




ft 










I 






IKAQICSLVE 
IKAQICSLVE 
IKAQICSLVE 
IKAQICSLVE 



HBRRlDPElGLg!3yVD^ 

310 320 330 340 350 




QVjfyG]5}VGEEjELQALTDMP[jFYKTI^^ 
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390 



410 



420 



NOV15 

15011849] 
7243143 | 
1745916l| 
11360291 | 
15237322 j 



gi 

gi 
gi 



NOV15 

gi|l5011849| 
gij 7243143 | 
gi|l7459161 | 
gi [11360291 1 
gi 1 15237322 j 



N0V15 

gijl5011849 | 
gij 7243143 j 
gijl7459161 
1 gijll360291 
gi|l5237322 



N0V15 

gi|l5011849[ 
gij 7243143 | 
gi jl745916lj 
gi jl!36029lj 
gi j 15237322 j 



NOV15 

gi|l5011849| 
gij 7243143 | 
gi j 17459161 1 
gij 11360291 j 
gijl5237322| 



gi 
gi 
gi 
gi 
gi 



150118491 
7243143 | 
17459161 | 
11360291 j 
15237322 j 



N0V15 

gi (15011849 | 
gij 7243143 j 
gi jl7459l61 
gi (11360291 
gi jl5237322 | 



gi 
gi 
gi 
gi 



15011849( 
7243143 | 
17459l6l| 
11360291 | 
15237322 | 



N0V15 
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The Chinese hamster ovary (CHO) cell mutants ldlC and ldjB, which exhibit almost 
identical phenotypes, define two genes required for multiple steps in the normal medial and 
trans Golgi-associated processingof glycoconjugates (Chatterton, JE, et ah, Proc Natl Acad Sci 
5 U S A 1999 Feb 2;96(3):915-20). The LDLC gene encodes ldlCp, an approximately 80-kDa 
protein, which in wild-type, but not ldlB, cells associates reversibly with the cytoplasmic 
surface of the Golgi apparatus. Here, we have used a retrovirus-based expression cloning 
system to clone a murine cDNA, LDLB, that correct? the pleiotropic mutant phenotypes of 
ldlB cells. The corresponding mRNA was not detected in ldlB mutants. LDLB encodes an 

10 approximately 1 10-kDa protein, ldlBp, which lacks homology to known proteins and contains 
no common structural motifs. Database searches identified short segments of homology to 
sequences from Drosophila melanogaster, Arabidopsis thaliana, and Caenorhabditis elegans, 
and the essentially full-length homologous human sequence (82% identity); however, as was 
the casefor ldlCp, no homologue was identified in Saccharomyces cerevisiae. We have found 

15 that in wild-type cell cytosols, ldlCp is a component of an approximately 950-kDa "ldlCp 

complex," which is smaller, approximately 700 kDa, in ldlB cytosols. Normal assembly of this 
complex is ldlBp-dependent and may be required for Golgi association of ldlCp and for the 
normal activities of multiple luminal Golgi processes. Mutations in the LDL receptor (LDLR) 
gene on chromosome 19 cause this disorder. 

20 Familial hypercholesterolemia is characterized by elevation of serum cholesterol bound 

to low density lipoprotein (LDL) and is, hence, one of the conditions producing the 
hyperlipoproteinemia II phenotype. Heterozygotes develop tendinous xanthomas, corneal 
arcus, and coronary artery disease; the last usually becomes evident in the fourth or fifth 
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decade. Homozygotes develop these features at an accelerated rate in addition to planar 
xanthomas, which may be evident at birth in the web between the first 2 digits. 

The ranges of serum cholesterol' and LDL-chdlesterol are, in mg per dl, 250-450 and 
200-400 in heterozygotes, greater than 500 and greater than 450 in homozygous affecteds, and 
5 150-250 and 75-175 in homozygous unaffecteds, with some positive correlation with age 
(Khachadurian, Am, J. Med. 37: 402-407, J964;.Kwiterovich et aJ r , J. Clin. Invest. 53: 1237- 
1249, 1974). Houlston et al. ((Letter) Lancet II: 405 only,1988) studied the relationship of 
lipoprotein(a) levels and coronary heart disease in patients with familial hypercholesterolemia. 
Individuals with coronary artery 1 disease had a significantly higher mean lip oprotein(a) 

10 concentration than those without coronary heart disease, suggesting that lipoprotein(a) 

measurements may help predict the risk of coronary heart disease in individuals with familial 
• hypercholesterolemia. By studies of cultured fibroblasts from homozygotes, Goldstein and 
Brown (Proc. Nat. Acad. Sci. 70: 2804-2808,1973) and Brown and Goldstein (Proc. Nat. 
Acad. Sci. 71: 788-792,1974) showed that the basic defect concerns the cell membrane 

15 receptor for LDL. Normally, LDL is bound at the cell membrane and taken into the cell ending 
- up in lysosomes where the protein is degraded and the cholesterol is made available for 

repression of microsomal enzyme 3-hydroxy-3-methylglutaryl coenzyme A (HMG CoA) 
reductase, the rate-limiting step in cholesterol synthesis. At the ame time, a reciprocal 
stimulation of cholesterol ester synthesis takes place. The same workers found that both 

20 receptor-absent and receptor-defective mutants occur and they concluded that some of the 

'homozygotes' are in fact genetic compounds. An internalization mutant of the LDL receptor 
binds LDL but is unable to facilitate passage of LDL to the inside of the cell (Goldstein et al, 
Cell 12: 629-641,1977). A patient was found to be a genetic compound, having inherited the 
internalization mutant from the father and the binding mutant from the mother. From the fact 

25 that an individual was shown by family studies to be a genetic compound and that 

complementation did not occur, Goldstein et al. (1977) concluded that the gene for binding of 
LDL and the gene for internalization of LDL are allelic mutations at the structural locus for 
the LDL receptor. Miyake et al. (Proc. Nat. Acad. Sci. 78:5151-5155,1981) found 
homozygosity for the internalization defect. Harders-Spengel et al. (Proc. Nat. Acad. Sci. 79: 

30 6355-6359,1982) presented evidence that the receptor defect is present on liver membranes. 
Levy et al. (Proc. Nat. Acad. Sci. 79: 6355-6359,1986) reported 2 brothers with a unique 
genetic compound form of 'homozygous' hypercholesterolemia in which the mother had 
typical FHC and the father and 3 of his close relatives had what they termed the HMWR (high 
molecular weight receptor) trait. In these persons 2 types of functional LDL receptors were 
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found in cultured skin fibroblasts: one with normal molecular weight of 140,000 and one with 
molecular weight of 176",000. Curiously and puzzlingly, the compound heterozygotes and the 
regular heterozygotes for 'the HMWR showed increased' cholesterol synthesis. Furiahashi et al. 
(Hum. Genet.'79: 103-108,1988) studied 16 Japanese kindreds with homozygous FHC. Ten 
5 had a receptor-negative form of the disease; 5 had a receptor-defective form; and 1 represented 
an internalization defect. The receptor-defective group, in which functional receptors were 
produced, showed a lower tendency to coronary artery disease than the receptor-negative , 
group. The receptor, is synthesized as a 120-kD glycoprotein precursor that undergoes change 
to a 160-kD mature glycoprotein through the addition, apparently not of carbohydrate, but 

10 covalently of a 40-kD protein. In the 77 homozygotes studied by Goldstein (Personal 

Communication. Dallas, Tex., 1/4/1983), all involved alterations in the structural gene for the 
120-kD precursor (Tolleshaug et al., Cell 30: 715-724, 1982; Hobg6od et al., (Abstract) Clin. 
Res. 31: 478A only, 1983). On the basis of size alone, he could identify 7 different mutations 
affecting the 120-kD precursor. About half of the 77 'homozygotes* are in fact? genetic 

15 compounds. Jensen et al. (Ann. Hum. Genet. 63: 51 1-520,1999) studied 17 families with 

mutations in the LDLR gene as a model in which to test formally for linkage directly between 
an atherogenic genotype and ischemic heart disease or aorto-coronary calcified 
atherosclerosis. In each family, 1 of 3 different mutations was found: the trp66-to-gly 
mutation, the trp23-to-ter mutation, or a 10-kb deletion removing exons 3 through 6 of the 

.5 

20 LDLR gene. Genomic DNA was used to determine these mutations by either enzymatic 
cleavage assays or Southern blotting. The aorto-coronary calcification was significantly 
associated with age and plasma cholesterol. Sex, hypertension, body mass index, and smoking 
were not associated with the aorto-coronary calcification. Nonparametric analysis indicated 
significant linkage of the LDLR locus to aortic (p less than 0.00005) and to aorto-coronary 

25 calcified atherosclerosis (p less than 0.00001). Assuming a dominant mode of inheritance, 
significant linkage was detected for aortic (lod - 3.89) and aorto-coronary calcified 
atherosclerosis (lod = 4.10). Jensen et al. (1999) suggested that the atherogenicity of variations 
in other genes could be assessed by a similar approach. 

Three independent linkage studies, by Ott et al. (Am. J. Hum. Genet. 26: 598- 

30 603,1974), Berg and Heiberg (Cytogenet. Cell Genet. 16: 266-270,1976), and Elston et ah 

(Cytogenet. Cell Genet. 16: 294-297,1976), strongly suggested loose linkage between familial 
hypercholesterolemia and the third component of complement; C3 has been mapped to 
chromosome 19 by somatic cell hybridization. Other studies of somatic hybrid cells suggested 
that the gene(s) for low density lipoprotein receptor may be on chromosome 5 or 21 or both 
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((Abstract) Cytogenet. Cell Genet. 32: 295-296,Maartmann-Moe et al^ 1982). By family 
studies, Berg and Heiberg ((Abstract) Cytogenet. Cell Genet. 25: 136-137,1979) found a lod 
score of 4.0 for linkage with H]Ja at a recombination fraction of 0.14. Donald et al. 
((Abstract) Cytogenet. Cell Genet. 37: 452 only,1984) presented further data on HC-C3 
linkage, bringing the combined male- female lod score to a maximum of 3,79 at theta 0.25. 
Francke et al. (Proc. Nat. Acad. Sci. 81: 2826-2830,1984) assigned the LDL receptor to 
chromosome 19 on the basis of expression studies in hamster-human somatic cell hybrids;. 
[Francke et al. (198*4) suggested that the locus should be designated LDLR for consistency 
with the policy of the Human Gene Mapping Workshops to name loci by the wildtype gene 
product when known.] It is interesting that both the receptor and one of its ligands (APOE) are 
on chromosome 19. C3 and FHC are about 20 cM apart; APOE and C3 are about 15 cM apart. 
FHC is not closely linked to APOE, suggesting that these 2 loci are on opposite sides of C3. 
Transferrin and transferrin receptor are both on chromosome 3. The LDLR gene was 
regionalized to I?pl3.1-pl3.3 by in situ hybridization (Lindgren et al., Proc. Nat. Acad. Sci. 
82: 8567-8571,1985). Judging by the sequence of loci suggested by linkage data (pter-FHC- 
C3--APOE/APOC2), the location of FHC (LDLR) is probably 19pl3.2-pl3.12 and of C3, 
19pl3.2-pl3.1L Humphries et al. (Lancet I: 1003-1005,1985) found a RFLP of the LDL 
receptor gene using the restriction enzyme Pvull. About 30% of persons are heterozygous for 
the polymorphism which is, therefore, useful in family studies and early diagnosis of FHC. 
Leppert et al. (Am. J. Hum. Genet. 39: 300-306, 1986) found tight linkage between a RFLP of 
the LDL receptor gene and dominantly inherited hypercholesterolemia; specifically, no 
exception to cosegregation was found between high-LDL cholesterol phenotype and a unique 
allele at the LDLR locus. The maximum lod score was 7.52 at theta = 0. Li et al. (Nature 335: 
414-417,1988) worked out a PCR method for analyzing DNA sequences in individual diploid 
cells and human sperm. They showed that 2 genetic loci could be coamplified from a single 
spenn, and proposed its use for genetic linkage studies. They analyzed the genotype of single 
sperm derived from an individual heterozygous at the LDLR locus and the HLA-DQ(alpha) 
locus and could show independent assortment. Individual sperm were drawn into a fine plastic 
needle under microscopic observation and delivered to a tube for lysis and amplification. 
Eighty individual spenn were analyzed for the study of independent assortment of LDLR and 
DQA. The method has great promise for fine mapping. Boehnke et al. (Am. J. Hum. Genet. 
45: 21-32,1989) described the experimental design and issues of sample size to be considered 
in the application of the method to the generation of fine-structure human genetic maps. 
Atherogenic lipoprotein phenotype (108725) shows close linkage to the LDLR locus; indeed, 
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the mutation(s) responsible for this phenotype may reside in the LDLR gene rather than in a 
separate, closely situated gene. Vaughan et al. (Genes Chromosomes Cancer 28: 133-137, 
2000) described the cytogenetic analysis of an unusual giant cardiac lipoma that showed . 
myocardial invasion, occurring in a patient with a history of multiple lipomatosis (151900). 
5 Cytogenetic studies of cells derived from the cardiac lipoma demonstrated no abnormalities at 
chromosome 12, but did reveal a t(2;19)(pl3;pl3.2) translocation. FISH analyses assigned the 
pll5-RhoGEF gene (601855) to 19ql3.2-ql3.3 and mapped the LDLR gene to 19pl3.2, 
centromeric to the t(2;19) breakpoint. Thus, Vaughan et al. (2000) concluded that these genes 
axe unlikely to be involved in the translocation found in this cardiac lipoma. 

10 ( Russell et al. (Cell 37: 577-585,1984) demonstrated DNA sequence homology of the 

LDL receptor with the epidermal growth factor receptor (EGF; 131530). Sudhof et al. (Science 
' 228: 815-822,1985) found that the gene for LDL receptor is more than 45 kb long and contains 
1 8 exons, most of which correlate with functional domains previously defined at the protein 
level. Of the 18 exons, 33 encode protein sequences that are homologous to sequences in other 

15 proteins: 5 encode a sequence similar to one in C9 component of complement; 3 encode a 

sequence similar to a repeat sequence in the precursor for EGF and in 3 proteins of the blood 
clotting system— factor IX, factor X, and protein C, and 5 other exons encode nonrepeated 
sequences that are shared only with the EGF precursor. Since the LDL receptor is a mosaic 
protein built up of exons shared with different proteins, it is a member of several supergene 

20 families. Gilbert (Science 228: 823-824,1985) commented on the relevance of these findings 
to understanding the significance of 'split genes' and 'exon shuffling' during evolution. 
Horsthemke et al. (Europ. J. Biochem. 164: 77-81,1987) analyzed DNA from 70 UK patients 
with heterozygous familial hypercholesterolemia. In most, the restriction fragment pattern of 
the LDLR gene was indistinguishable from the normal; however, 3 patients were found to 

25 have a deletion of about 1 kb in the central portion of the gene. In 2 patients, the deletion 

included all or part of exon 5; in the third, the deletion included exon 7. Including a previously 
described patient with a deletion in the 3 -prime part of the gene, these results indicated that 4 
out of 70 patients, or 6%, have deletions. Langlois et al. (J. Hum. Genet. 43: 60-68,1988) 
screened 234 unrelated heterozygotes for FH to detect major rearrangements in the LDLR 

30 gene. Total genomic DNA was analyzed by Southern blot hybridization to probes 

encompassing exons 1 to 18 of the LDLR gene. Six different mutations were detected and 
characterized by use of ex on-specific probes and detailed restriction mapping. The frequency 
of deletions in the Langlois et al. (1988) study was 2.5% (6 out of 234 patients). An 
illustration of previously mapped deletions and the deletions identified in this study (a total of 
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16) suggested that particular areas in the LDLR gene are susceptible to deletion. In a Japanese 
subject with homozygous hypercholesterolemia, Lehrman et ah (Cell 48: 827-835,1987) found 
a 7.8-kb deletion in LDLR. The deletion joined intron 15 to the middle of epcon 18, which 
encodes the 3 -prime untranslated region, thereby removirig all 3-prime splice acceptor sites 
. 5 distal to intron 15. The rnRNA should produce a truncated receptor that lacks the normal 
membrane-COOH terminus. The truncated protein was such that rnore than 90% of the 
receptor was secreted from the cell, and the receptors remaining on the surface showed 

i 

defective internalization. The deletion resulted from recombination between 2 repetitive 
sequences of the Alu family, one in intron 15 and the other in exon 18. Lehrman et al. (1987) 
10 stated that Alu sequences had been found at the deletion joints of all 4 gross deletions that had 
been characterized in LDLR. Because of these and similar findings in connection with 
' deletions in the gamma-delta-beta-globin cluster, recombination between Alu sequences 
appears to be a frequent cause of deletions in the human genome. Horsthemke et al. (1987) 
suggested that unequal crossing-over between 2 Alu-repetitive DNA sequences was 

1 5 responsible for an intragenic deletion of the LDLR gene leading to familial 

hypercholesterolemia. A 4-kb deletion had occurred between an Alu -repetitive sequence in 
intron 12 and a sequence in intron 14. The deletion eliminated exons 13 and 14 and changed 
the reading frame of the resulting spliced rnRNA such that a stop codon was created in the 
following exon. The truncated receptor protein appeared to be rapidly degraded. The deletion 

20 was presumably caused by an unequal crossover event between 2 homologous chromosomes 
at meiosis. Alu sequences are widely scattered in the genome, being present in 300,000 to 
500,000 copies. They have been described, for example, in the genes for alpha-globin (see 
J41800), gastrin (137250), gamma crystalJin (123660), insulin-like growth factor 11(147470), 
and soluble thymidine kinase. Each is about 300 bp long; thus Alu sequences represent about 

25 3% of the total DNA. On the basis of structural similarity, the origin of Alu elements can be 
. traced to the gene for 7SL RNA (Nature 312: 171-172, Ullu and Tschudi, 1984). The abundant 
cytoplasmic 7SL RNA functions in protein secretion as a component of the signal-recognition 
particle. This particle, consisting of 6 different polypeptides and 1 molecule of 7SL RNA, 
mediates the translocation of secretory proteins across the cytoplasmic reticulum. Although 

30 the 7SL RNA has a well-defined biologic function, that of the related Alu repeat remains 

unknown. Thus, the 7SL RNA gene may be a progenitor of a processed pseudogene, the Alu 
element, that has 'recently 1 spread to different locations in the human genome. The average 
Alu family member probably integrated into its present genomic location about 1 5-30 Myr 
ago. The Alu family is specific to primates, suggesting that these repeats were not present as 
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little as 65 Myr ago. According to the Alu family copy number, one would, on the average, 
expect to rind 1 such repeat every 3 to 5 kb in the human genome if they are randomly 
distributed. However,' studies of the albumin/alpha- fetoprotein family by Rifffher et al. (Molec. 
Biol Evol. 4: 1-9,1987) and of the thymidine kinase (1 88300) and beta-tubulin genes by 
5 Slagel et al. (Molec. Biol. Evol. 4: 19-29,1987) indicate clustering of Alu repeats in some parts 
of the genome. For example, the beta-tubulin gene has 10 of these repeats in less, than 5 kb of a 
single intron and the thymidine kinase has'13 members' within its introns in a region of about 
10 kb. Hobbs et al. (New Eng. J. Med. 317: 734-737,1987) found a large deletion (more than 
10 kb) in the LDLR gene' in 63% of French Canadians with heterozygous FH. The deletion 

1 0 also occurred in homozygous form in 4 of 7 French-Canadian homozygotes. The deletion 

removed the promoter and first ex on of the gene and abolished the production of mRNA for 
LDL receptor. The high frequency of the mutation was interpreted is representing founder 
effect; 8,000 ancestors account for, present-day French Canadians and there has been relatively 
little outbreeding. The deletion has not been observed in any other ethnic groufp. It can be 

15 detected by analysis of genomic DNA frorri blood leukocytes, thus allowing direct diagnosis 
of FH in most affected French Canadians. Ma et al. (Clin. Genet. 36: 219-228,1989) identified 
a second 'French Canadian* LDLR gene deletion which was found in 4 of 80 heterozygotes 
(5%). The mutation consisted of a 5-kb deletion removing exons 2 and 3 of the LDLR gene, 
which corresponded to the first 2 repeats of the LDLR-binding domain. Hobbs et al. (J. Clin. 

20 Invest. 81 : 909-917,1988) found that 16 of 132 cell strains (12%) from persons with the 
clinical syndrome of homozygous familial hypercholesterolemia synthesized no 
immunodetectable LDL receptor protein, indicating the presence of 2 mutant genes that failed 
to produce crossreacting material (CRM-negative mutants). DNA and mRNA from all but one 
of these CRM-negative patients were available for study. Haplotype analysis based on 10 

25 RFLPs suggested that the 30 CRM -negative genes represented by these 1 5 individuals had 

included 13 different mutant alleles. Four of the alleles produced no mRNA; 3 of these 4 had 
large deletions ranging from 6 to 20 kb that eliminated the promoter region of the gene. The 
reason for the lack of mRNA in the fourth allele was not apparent. Three alleles encoded 
mRXAs of abnormal size. One of the abnormal mRNAs was produced by a gene harboring a 

30 deletion, and another was produced by a gene with a complex rearrangement. The third 
abnormal-sized mRNA (3.1 kb larger than normal) was produced by an allele that had no 
detectable alteration as judged by Southern blotting. The other 6 rnRNA-positive alleles 
appeared normal by Southern blotting and produced normal-sized mRNA but no receptor 
protein. Among 20 mutant LDL receptor genes, Yamakawa et al. (Hum. Genet. 82: 317- 
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321,1989) found 4 different deletion mutations (20%). None of these had been reported 

i » ** i' » 1 • 

previously in Caucasians. Three of them were novel and one was similar to a previously 

described Japanese mutation. In '3 of the 4 deletions, the : rearrangements were related to intron 

15 in which there are many Alu sequences. Leitersdorf et al. (J. Clin. Invest. 85: 1014- 

5 1023,1990) analyzed the LDL receptor genes of 1 1 French-Canadian FH homozygotes. Only 3 

different LDLR haplotypes were identified, and the codingregion of the allele associated with 

each was sequenced. Three different missense mutatioris were found. Assays developed to 

detect each of these directly were applied to 130 FH heterozygotes from the greater Montreal 

area. The common deletion responsible for about 60% of cases (Hobbs et al., 1987) and the 

10 smaller deletion identified by Ma et al. (1989) and found in about 5% of French Canadians 

were also sought. They were able to detect LDL receptor mutations in 76% of the subjects and 

14% had 1 of the 3 missense mutations. In the Saguenay-Lac St. Jean region of Quebec 

province, De Braekeleer (Hum. Hered. 41 : 141-146,1991) estimated the prevalence of familial 

hypercholesterolemia as 1/122, compared to the usually used frequency of 1/300 for European 

15 populations.. Yamakawa et al. (Hum. Genet. 80: 1-5,1988) described a TaqI polymorphism in 
the LDLR gene which should be useful in the study of FH. Leitersdorf et al. (Am. J. Hum. 
Genet. 44: 409-421,1989) used 10 different RFLPs to construct 123 differing haplotypes from 
20 pedigrees. The 5 most common haplotypes accounted for 67.5% of the sample. 
LI eterozygosity and polymorphism information content (PIC) for each site were determined. 

20 Hobbs et al. (Annu. Rev. Genet. 24: 133-170,1990) reviewed the many mutations found in the 
LDLR gene. Rudiger et al. (Europ. J. Biochem. 198: 107-11 1,1991) reviewed previously 
described deletions in the LDLR gene in cases of familial hypercholesterolemia and reported 
the finding of a deletion in 3 of 25 unrelated patients with FH. Two of these were equivalent to 
previously described LDLR alterations, thus supporting a notion of recombination hotspots 

25 which involve Alu sequences. In at least 4 cases (FH626, PO, JA, and FH-DK3), a deletion of 
ex on 5 of the LDLR gene has been found as the defect responsible for FH. The FH626 
mutation was characterized by Hobbs et al. (J. Biol. Chem. 261 : 131 14-13120,1986) and 
found to involve Alu repeat sequences in introns 4 and 5. Rudiger et al. (1991) characterized 
FH-DK3 and likewise found involvement of 2 Alu repeated sequences present in introns 4 and 

30 5. The crossover breakpoints involve sequences similar to those reported for FH626 but not at 
identical positions in the 5-prime end. By use of denaturing gradient gel electrophoresis 
(DGGE) in combination with PCR, Top et al. (Hum. Genet. 89: 561-565,1992) found no 
evidence of a promoter mutation in the LDLR gene in 350 heterozygotes for FH. tlobbs et al. 
(Hum. Mutat. 1 : 445-466,1 992) reviewed 71 mutations in the LDL receptor gene that had been 
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characterized at the molecular level and added 79 additional mutations. Furthermore, they 
reviewed the insight that all 1 50 mutations provided into the structirre/function relationship of 
the receptor protein and the clinical manifestations of FH. WilSon' et ah (Am, J. Cardiol. .81 : 
1509-1511,1998) described an online database of LDLR mutations. Lee et al. (Am. J. Cardiol. 
81: 1509-1511,1998) studied 80 unrelated individuals with FH from the West of Scotland. 
Microsatellite analysis using Dl 9S394 was informative in 20 of 23 families studied. In these 
families, there was no inconsistency with segregation of the FH phenotype with the LDLR 
locus. Using SSCP, Lee et al. (J. Med. Genet. 35: 573-578,1998) also detected mutations in 
exon 4 of the LDLR gene in 15 bf 80 of these individuals; 7 of 15 had the same cys!63-to-tyr 
mutation (143890.0058). Lee et al.. (1998) concluded that microsatellite analysis using 
D19S394 is useful in tracking the LDLR gene in families and could be used in conjunction 
with LDL cholesterol levels to diagnose FH, especially in children and young adults, in whom 
phenotypic diagnosis can be difficult. Knoblauch et al. (Am. J. Hum. Genet. 66: 157- 
166,2000) studied an Arab family that earned the tyr807-to-cys substitution (143890.0019). In 
this family, some heterozygous persons had normal LDL levpls, while some homozygous 
individuals had LDL levels similar to those persons with heterozygous FH. The authors 
presented evidence for the existence of a cholesterol-lowering gene on 13q (604595). 

Davis et al. (Cell 45: 15-24,1986) found that substitution of cysteine for tyrosine in the 
cytoplasmic domain of LDL receptors impedes their internalization. Lehxman et al. (1985) 
found that genes with nonsense and frameshift mutations that truncated the cytoplasmic 
domain resulted in internalization-defective LDL receptors. Hobbs et al. (1986) described an 
LDL receptor mutant in which 1 of the 7 repeating units constituting the ligand binding 
domain had been deleted. The deletion arose by homologous recombination by repetitive Alu 
sequences in intron 4 and intron 5 of the gene. The deletion removed exon 5, which normally 
encodes the sixth repeat of the ligand binding domain. In the resultant mRNA, exon 4 was 
found to be spliced to exon 6, preserving the reading frame. The resulting shortened protein 
reaches the cell surface and reacts with antireceptor antibodies but does not bind LDL. It does, 
however, bind VLDL, a lipoprotein that contains apoprotein E as well as apoprotein B-100. 
The findings in this instructive case support the hypothesis that the 7 repeated sequences in the 
receptor constitute the LDL binding domain, that the sixth repeat is required for binding of 
LDL but not of VLDL, and that deletion of a single repeat can alter the binding specificity of 
the LDL receptor. Sass et al. (Hum. Genet. 96: 21-26,1995) described a 4-generation French- 
Canadian kindred with familial hypercholesterolemia in which 2 of the 8 heterozygotes for a 
5-kb deletion (exons 2 and 3) in the LDLR gene were found to have normal LDL-cholesterol 
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levels. Analyses showed that it was unlikely that variation in the genes encoding 
apolipoprotein B, HMG-CoA reductase, apoAi-CIII-AIV, or lipoprotein lipase was 
responsible for the cholesterol -lowering effect Expression of the LDL receptor, as assessed in 
vitro with measurements of activity and mRNA levels, was similar in normolipidemic and 
5 hyperlipidemic subjects carrying the deletion. Analysis of the apoE isoforms, on the other 
hand, revealed that most of the E2 allele carriers in this family, including the 2 
normolipidemic 5-kb deletion carriers, had LDL cholesterol levels substantially lower than 
subjects with the other apoE isoforms. Thus, this kindred provided evidence for the existence 
of a gene or genes, including the apoE2 allele, with profound effects on LDL-cholesterol 
10 levels. In a 13-year-old girl with severe hypercholesterolemia, Ekstrom et al. (Clin. Genet. 55: 
332-339,1999) demonstrated compound heterozygosity for a cys240-to-phe mutation 
(143890.0059) and a tyrl67-to-ter mutation (143890.0045) in the Lt>LR gene. Her 2 
heterozygous sibs also carried the ,C240F mutation, but only one of them was 
hypercholesterolemic. The authors concluded that there may be cholesterol- lowering 

15 mechanisms that are activated by mutations in other genes. 

Feussner et al. (Am. J. Med. Genet. 65: 149-154,1996) described a 20-year-old man 
with a combination of heterozygous FH caused by splice mutation (143890.0054) and type III 
hyperlipoproteinemia (107741). He presented with multiple xanthomas of the elbows, 
interphalangeal joints and interdigital webs of the hands. Active lipid- lowering therapy caused 

20 regression of the xanthomas and significant decrease of cholesterol and triglycerides. Flat 

xanthomas of the interdigital webs were described in 3 of 4. formerly reported patients with a 
combination of these disorders of lipoprotein metabolism. Feussner et al. (1996) proposed that 
the presence of these xanthomas should suggest compound heterozygosity (actually double 
heterozygosity) for FH and type III hyperlipoproteinemia. Varret et al. (Nucleic Acids Res. 25: 

25 172-180,1997) described a database of LDLR genes and provided a listing of the 210 

mutations it contained as of the fall of 1996. Vergopoulos et al. (Europ. J. Hum. Genet. 5:315- 
323, 1997) presented findings suggesting the existence of a xanthomatosis-susceptibility gene 
in a consanguineous Syrian kindred containing 6 individuals with homozygous FH (see 
602247). Half of the homozygotes had giant xanthomas, while half did not, even though their 

30 LDL-cholesterol concentrations were elevated to similar degrees (more than 14 mmol/1). 

Heterozygous FH individuals in this family were also clearly distinguishable with respect to 
xanthoma size. By DNA analysis they identified a hitherto imdescribed mutation in the LDLR 
gene in this family: a T-to-C transition at nucleotide 1999 in codon 646 of exon 14, resulting 
in an arginine for cysteine substitution. Segregation analysis suggested that a separate 
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susceptibility gene may explain the formation of giant xanthomas. In homozygous familial 
hypercholesterolemia, the aortic root is prone to develop therosclerotic plaque at an early age.' 
Such plaques can accumulate in unusual sites, such as the ascending aorta and around the. 
coronary ostia. Summers et al. (Circulation 98: 509-518,1998) evaluated the aortic root using 
5 MRJ imaging in a blinded, prospective study of 17 homozygous FH patients and 12 healthy 
controls. When patient age and body mass index, were taken into account, 53% of patients with 
homozygous FH had increased aortic wall thickness compared 1 to controls; this was thought to 
result from a combination of medial hyperplasia and plaque formation. Supravalvular aortic 
stenosis was seen in 41% of patients. 

10 1 Hepatitis C virus (HCV), the principal viral cause of chronic hepatitis, is not readily 

replicated in cell culture systems, making it difficult to ascertain information on cell receptors 
for the virus. However, several obseivations from studies on the role of HCV in mixed 
cryoglobulinemia provided some insight into HCV entry into cells. Evidence indicated that 
HCV and other viruses enter cells through the mediation of LDL receptors: by the 

15 demonstration that endocytosis of these viruses correlates with LDL receptor activity, by 
complete inhibition of detectable endocytosis by anti-LDL receptor antibody, by inhibition 
with anti-apolipoprotein E and anti-apolipoprotein B antibodies, by chemical methods 
abrogating lipoprotein/LDL receptor interactions, and by inhibition with the endocytosis 
inhibitor phenylarsine oxide. Agnello et al. (Proc. Nat. Acad. Sci. 96: 12766-12771,1999) 

20 provided confirmatory evidence by the lack of detectable LDL receptor on cells known to be 
resistant to infection by one of these viruses, bovine viral diarrheal virus (BVDV). 
Endocytosis via the LDL receptor was shown to be mediated by complexing of the virus to 
very low density lipoprotein (VLDL) or LDL, but not high density lipoprotein (HDL). Studies 
using LDL receptor-deficient cells or a cytolytic BVDV system indicated that the LDL 

25 receptor may be the main but not exclusive means of cell entry of these viruses. 

Compactin is a potent competitive inhibitor of 3 -hydroxy- 3 -methyl glutaryl coenzyme- 
A reductase and may prove useful in the treatment of hypercholesterolemia (Betteridge et 
al.,Lancet II: 1342-1343 1978). Starzl et al. (Lancet I: 1382~1383„1984) performed both heart 
transplant and liver transplant in a 6.75-year-old girl with homozygous familial 

30 hypercholesterolemia. 

Wilson et al. (Hum. Gene Therapy 3: 179-222,1992) presented a detailed clinical 
protocol for the ex vivo gene therapy of familial hypercholesterolemia. The approach, which 
they proposed to use to treat homozygous FH patients with symptomatic coronary artery 
disease who have a relatively poor prognosis but can tolerate a noncardiac surgical procedure 
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with acceptable risks, involves recovery of hepatocytes from the patient and reimplanting them 
after genetic correction by a retro virus-mediated gene transfer. Not only were the technical 
details of vectors and viruses, transduction and delivery of hep&tocytes, evaluation of 
engraftment and rejection, etc., discussed, but also assessment of risks versus benefits and 
5 informed consent for both adult and child patients. 

Tonstad et al. (J. Pediat. 129: 42^-49,1996) conducted a double-blind placebo- 
controlled trial over 1 year using 8 grams of cholestyramine hvprepubertal children (aged 6-11 
years) with familial hypercholesterolemia. After 1 year of a low-fat, low-cholesterol diet, 
children with a family history of* premature cardiovascular disease had LDL cholesterol levels 
1 0 at or greater than 4.9 mmol/liter, while children without such a family history had LDL 

cholesterol levels at or greater than 4.1 mmol/liter. The LDL cholesterol levels in the test group 
1 lowered by 1 6,9% (95% confidence interval), compared with a 1 .4% increase in the placebo 
group. 

Growth velocity was not adversely affected in the treatment group, although folate 

1 5 and 25-hydroxyvitamin D deficiency were noted among a small number of treated children. 
Additionally, a boy who had an appendectomy 3 months before the study required surgery for 
intestinal obstruction after he had taken the first 2 cholestyramine doses. Given the number of 
gastrointestinal side effects, Tonstad et al. (1996) recommended caution in starting 
cholestyramine after abdominal surgery in children. 

20 In most populations the frequency of the homozygote is 1 in a million (probably a 

minimal estimate, being a prevalence figure rather than incidence at birth) and the frequency 
of heterozygotes not less than 1 in 500. Thus, heterozygous familial hypercholesterolemia is 
the most frequent mendelian disorder, being more frequent than either cystic fibrosis or sickle 
cell anemia which, in different populations, are often given that distinction. Among survivors 

25 of myocardial infarction, the frequency of heterozygotes is about 1 in 20. 

Seftel et al. (Brit. Med. J. 281: 633-636,1980) pointed to a high frequency of 
hypercholesterolemic homozygotes in South Africa. In a 7-year period, 34 homozygotes were 
seen in one clinic in Johannesburg. All were Afrikaners and most lived in Transvaal Province. 
The authors calculated the frequency of heterozygotes and homozygotes to be 1 in 100 and 1 

30 in 30,000, respectively. The oldest of their patients was a 46-year-old woman. Of the 34, six 
were age 30 or older. The authors concluded that the high frequency of the gene is attributable 
to founder effect, as in the case of porphyria variegata, lipoid proteinosis, and sclerosteosis. 
Torrington and Botha (Lancet II: 1 120 only, 1981) found that 20 of 26 families with FHC 
(77%) belonged to the Gereformeerde Kerk, whereas according to the 1970 census only 5% of 
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the Afrikaans-speaking white population of South Africa belonged to this religious group. 
Again, the data were consistent with a founder effect. Using the LDLR activity of 
lymphocytes, Steyn et aL (J. Med. Genet. 26: 32-36,1989) calculated the prevalence of 
heterozygous fHC in the permanent residents of a predominantly Afrikaans-speaking 
community in South Africa to be 1 in 71 -the highest prevalence reported to date. 

Like the French Canadians, the South Afrikaners appear to have a unique form of 
mutation in the LDLR gene consistent with founder effect (Brink et al., Hum. Genet. 77: 32- 
35,1987). Because of the presumed role of founder effect on the high frequency of familial 
hypercholesterolemia in South Africa, it is not surprising that Kotze et al. (J. Med. Genet. 24: 
750-755,1987) found a predominance of 2 haplotypes in 27 informative families with FH. In a 
study of homozygotes from the Afrikaner population in South Africa, Leitersdorf et al. (1989) 
found that 2 mutations account for more than 95% of the mutant LDL receptor genes. Both 
mutations were basepair substitutions that resulted in a single amino acid change and both 
could be detected readily with PCR and restriction analysis. The findings were considered 
consistent with the high frequency of FH being due to founder effect. Graadt van Roggen et al. 
(Hum. Genet. 88: 204-208,1991) studied the prevalence and distribution of the 3 common 
mutations in South Africa in 27 unrelated homozygous and 79 unrelated heterozygous FH 
Afrikaner patients from 2 regions of South Africa, the Transvaal and Cape Provinces. The 3 
mutations were FH Afrikaner- 1 (143890.0006), FH Afrikaner-2 (143890.0009), and FH 
Afrikaner-3 (143890.0044). 

The relative distribution of each of the 3 mutations was similar in the 2 regions, 
with frequencies of 66, 27, and 7%, respectively. Defects other than the 3 common mutations 
were more frequent in the Cape than in the Transvaal; thus, the 3 known mutations accounted 
for 98% of FH alleles in the Transvaal nd only 74% in the Cape Province. None of the patients 
carried the familial apolipoprotein B-100 mutation. 

Schuster et al. (Clin. Genet. 48: 90-92, 1995) identified yet another homozygote for 
the val408-to-met mutation (143890.0009), a 12-year-old Greek boy living in Germany. The 
mutation was present in both his parents, his brother, grandmother, uncle, and cousin. The 
haplotype, using 6 RFLPs of the LDL receptor gene, was different from the one reported 
earlier in Afrikaner and Dutch FH patients. Schuster et al. (1995) concluded that the mutation 
in the Greek boy probably occurred independently. Furthermore, they speculated that, because 
the parents were from different areas in Greece, the mutation may be common in Greeks. 
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In' the Saguenay-Lac St. Jean region of Quebec province, De Braekeleer (1991) 
estimated the prevalence of familial hypercholesterolemia as 1/122, compared to the usually 
used frequency of 1/500 for European populations. 

Defesche and Kastelein (Lancet 352: 1 643-1 644,1998) stated that more than 350 
5 different mutations had been found in patients with familial hypercholesterolemia. They 

tabulated the preferential geographic distribution that has been demonstrated for some of the 
LDL receptor mutations. For example, in the West of Scotland about ' half of the index cases 

of FH were found to have the cysl 63 - to- tyr mutation (143890.0058). Defesche and Kastelein 

» 

(1998) commented on the geographic associations of LDL receptor mutations within the 

10 Netherlands. 

Kingsley and Krieger (Proc. Nat. Acad. Sci. 81: 5454-5458,1984) identified 4 
different types of mutant Chinese hamster ovary cells with defective LDL receptor function. 
One locus, called ldlA, apparently represents the structural gene for LDL receptor, whereas the 
others— IdlB, IdlQ, and IdlD— appear to have defects involved in either regulation, synthesis, 

1 5 transport, recycling, or turnover of LDL receptors. 

The Watanabe heritable hyperlipidemic (WHHL) rabbit has a genetic deficiency of 
LDL receptors and is therefore a superb experimental model (Hornick et ah, Proc. Nat. Acad. 
Sci. 80: 6096-6100, 1983). Kite et al. (Proc. Nat. Acad. Sci. 84: 5928-5931,1987) found that 
probucol prevented the progression of atherosclerosis in the Watanabe rabbit by limiting 

20 oxidative LDL modification and foam cell transformation of macrophages. Probucol was 

originally developed as an antioxidant. Yamamoto et al. (1986) showed that the defect in the 
Watanabe heritable hyperlipidemic rabbit is a mutant receptor for LDL that is not transported 
to the cell surface at a normal rate. Cloning and sequencing of complementary cDNAs from 
normal and Watanabe rabbits showed that the defect arises from an in- frame deletion of 12 

25 nucleotides that eliminates 4 amino acids from the cysteine-rich ligand binding domain of the 
LDL receptor. Yamamoto et al. (1986) detected a similar mutation by Si nuclease mapping of 
LDL receptor nxRNA from a patient with familial hypercholesterolemia whose receptor also 
failed to be transported to the cell surface. These findings suggested to Yamamoto et al. 
(Science 232: 1230-1237,1986) that animal cells may have fail-safe mechanisms that prevent 

30 surface expression of improperly folded proteins with unpaired or improperly bonded cysteine 
residues. 

Scanu et al. (J. Lipid Res. 29: 1671-1681,1988) investigated hypercholesterolemia 
due to deficiency of the LDL receptor in a family of rhesus monkeys. Hummel et al. (Proc. 
Nat. Acad. Sci. 87: 3122-3126,1990) used PCR to analyze the mutation carried by members of 
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a family of rhesus monkeys with spontaneous hypercholesterolemia and low density 
lipoprotein receptor deficiency. Affected monkeys are heterozygous for a nonsense mutation * 
in ex on 6, changing codon 284 from T<5g to TAG. The G-to-A transition creates a new Spel 
restriction site. LDLR RNA is reduced by about 50% on Quantitative analysis of RNA 
5 obtained at liver biopsy in affected animals. 

Hofmann et al. (Science 239: 1277-1281,1988) found that overexpression of LDL 
receptors caused elimination of both apoE and apoB, the 2 ligands, from the plasma in 
transgenic mice derived from fertilized eggs injected with the LDLR gene under control of the 
mouse metallothionein-I promoier. They speculated that overexpression of other receptors, 

10 such as those for insulin (147670) or transferrin (1 90000), might have pathologic effects 
leading to a 'ligand steal' syndrome. 
1 ' Frank et al. (Genomics 5: 646-648,1989) identified RFLPs of the mouse LDL 

receptor gene and used them to map the gene, designated Ldlr, to the proximal region of 
chromosome 9. Using interspecific backcrosses, they established the order and interval 

15 distances for this and several other loci on mouse chromosome 9, namely, APOA4 (107690), 
which is on chromosome 1 1 in man, and mannosephosphate isomerase (154550), which is on 
chromosome 1 5 in man. 

Roy Chowdhury et al. (Science 254: 1802-1805,1991) used the Watanabe rabbit for 
the development of liver-directed gene therapy based on transplantation of autologous 

20 hepatocytes that had been genetically corrected ex vivo with recombinant retroviruses. 

Animals transplanted with LDLR-transduced autologous hepatocytes demonstrated a 30 to 
50% decrease in total serum cholesterol that persisted for the duration of the experiment (122 
days). Recombinant-derived LDLR RNA was harvested from tissues with no diminution for 
up to 6.5 months after transplantation. Ishibashi et al. (J. Clin. Invest. 92: 883-893,1993) 

25 developed a new animal model for homozygous FH through targeted disruption of the LDLR 
gene in mice. Homozygous LDL receptor-deficient mice showed delayed clearance of VLDL, 
intermediate density lipoproteins (DDL), and LDL from plasma. As a result, total plasma 
cholesterol level rose from 108 mg/dl in wildtype mice to 236 mg/dl in homozygous deficient 
mice. Adult mice did not exhibit gross evidence of xanthomatosis, however, and the extent of 

30 aortic atherosclerosis was minimal. On the other hand, Ishibashi et al. (J. Clin. Invest. 93: 

1885-1893,1994) showed that in mice homozygous for the targeted disiiiption of the LDLR 
gene who were fed a diet high in cholesterol, total plasma cholesterol rose from 246 to more 
than 1,500 mg/dl. In wildtype littermates fed the same diet, total plasma cholesterol remained 
less than 160 mg/dl. After 7 months, the homozygous deficient mice developed massive 
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xanthomatous infiltration of the skin and subcutaneous tissue. The aorta and coronary ostia 
" exhibited gross atheromata, and the aortic valve leaflets were thickened by cholesterol-laden . 
macrophages. ' '. 

The above defined information for NOV15 suggests that this NOV15 protein may 
5 function as a member of a Low Density Lipoprotein Receptor (LDLR) protein family. 
Therefore, the NOV15 nucleic acids and proteins of the invention are useful in potential 
therapeutic and diagnostic applications. For example, a cDNA encoding the NOV15 protein 
may be useful in gene therapy, and the NOV15 protein may be useful when administered to a 
subject in need thereof. By way of nonlimiting example, the compositions of the present 
10 'invention will have efficacy for treatment of patients suffering from Familial 

hypercholesterolemia; hyperlipoproteinemia II phenotype; tendinous xanthomas; corneal 
1 arcus; coronary artery disease; planar xanthomas; webbed digits; hypercholesterolemia; 
fertility; coronary artery disease; diabeties; atherosclerosis; xanthomatosis; Hepatitis C 
infection; regulation, synthesis, transport, recycling, or turnover of LDL receptors; Cerebral 
15 arteriopathy with subcortical infarcts and leukoencephalopatfiy; Epiphyseal dysplasia, multiple 
1 ; Ichthyosis, nonlamellar and nonerythrodermic, congenital; Leukemia, T-cell acute 
lymphoblastoid; Pseudoachondroplasia; SCED, autosomal recessive, T-negative/B-positive 
type; C3 deficiency; Diabetes mellitus, insulin-resistant, with acanthosis nigricans; 
Glutaricaciduria, type I; Hypothyroidism, congenital; Leprechaunism; Liposarcoma; 
20 Mucolipidosis IV; Persistent Mullerian duct syndrome, type I; Rabson-Mendenhall syndrome; 
Thyroid carcinoma, nonmedullary, with cell oxyphilia; Erythrocytosis, familial; Malaria, 
cerebral, susceptibility to; Bleeding disorder due to defective thromboxane A2 receptor; 
Cerebellar ataxia, Cayman type; Convulsions, familial febrile, 2; Cyclic hem atop oiesis; 
Fucosylrransferase-6 deficiency; GAMT deficiency; Von Hippel-Lindau (VHL) syndrome; 
25 Cirrhosis;' Transplantation; Psoriasis; Actinic keratosis; Tuberous sclerosis; Acne; Hair 

growth; allopecia; pigmentation disorders and/or endocrine disorders. The NOV15 nucleic 
acid encoding Low Density Lipoprotein Receptor (LDLR)-like protein, and the Low Density 
Lipoprotein Receptor (LDLR)-like protein of the invention, or fragments thereof, may further 
be useful in diagnostic applications, wherein the presence or amount of the nucleic acid or the 
30 protein are to be assessed. 
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NOVl 6 

NOVl 6 includes a Tumor necrosis factor receptor (TNFR)-like protein and a TNF 
Receptor Asspciated Factor 5 (TRAF 5)-like disclosed below. The disclosed proteins have 
been named NOV 16a and NOV 16b, respectfully. 

NOVl 6a 

A disclosed NOVl 6a nucleic acid 'of 1787 nucleotides (designated CuraGen Abe. No. 
CG56108-01) encoding a novel Tumor necrosis factor receptor (TNFR)-like protein is shown 
in Table 16 A. 1 An open reading frame was identified beginning with an ATG initiation codon 
at nucleotides 7-9 and ending with a TAG codon at nucleotides 1711-1713. Putative 
untranslated regions upstream from the initiation codon and downstream from the termination 
codon are underlined in Table 1 6A, and the start and stop codons a^e in bold letters. 



Table 16A. NOyi6a Nucleotide Sequence (SEQ ID NO:37) 



CCCACAA TGGCTTATTCAGAAGAGCATAAAGGTATGCCCTGTGGTTTCATCCGCCAGAATTCCGGCAACTCCATTTCCT 
TGGACTTTGAGCCCAGTATAGAGTACCAGTTTGTGGAGCGGTTGGAAGAGCGCTACAAATGTGCCTTCTGCCACTCGGT 
GCTTCACAACCCCCACCAGACAGGATGTGGGCACCGCXTCTGCCAGCACTGCATCCTGTCCCTGAGAGAATTAAACACA 
G TG CCAATC TG CCCTGTAG ATAAAG AGGTCATCAAATCTCAGGAGG TTTTTAAAG ACAATTGTTGCAAAAGAGAAG TCC 
TCAACTTATATGTATATTGCAGCAATGCTCCTGGATGTAATGCCAAGGTTATTCTGGGCCGGTACCAGCAGGTCCCACT 
GGCCTGTTGTTATCTGTTGCAGGATCACCTTCAGCAGTGCTTATTTCAACCTGTGCAGTGTTCTAATGAGAAGTGCCGG 
GAGCCAGTCCTACGGAAAGACCTGAAAGAGCATTTGAGTGCATCCTGTCAGTTTCGAAAGGAAAAATGCCTTTATTGCA 
AAAAGGATGTGGTAGTCATCAATCTACAGAATCATGAGGAAAACTTGTGTCCTGAATACCCAGTATTTTGTCCCAACAA 
TTGTGCGAAGATTATTCTAAAAACTGAGGTAGATGAACACCTGGCTGTATGTCCTGAAGCTGAGCAAGACTGTCCTTTT 
AAGCACTATGGCTGTGCTGTAACGGATAAACGGAGGAACCTGCAGCAACATGAGeATTCAGCCTTACGGGAGCACATGC 
GTTTGGTTTTAGAAAAGAATGTCCAATTAGAAGAACAGATTTCTGACTTACACAAGAGCCTAGAACAGAAAGAAAGTAA 
AATCCAGCAGCTAGCAGAAACTATAAAGAAACTTGAAAAGGAGTTCAAGCAGTTTGCACAGTTGTTTGGCAAAAATGGA 
AGCTTCCTCCCAAACATCCAGGTTTTTGCCAGTCACATTGACAAGTCAGCTTGGCTAGAAGCTCAAGTGCATCAATTAT 
TACAAATGGTTAACCAGCAACAAAATAAATTTGACCTGAGACCTTTGATGGAAGCAGTTGATACAGTGAAACAGAAAAT 
TACCCTGCTAGAAAACAATGATCAAAGATTAGCCGTTTTAGAAGAGGAAACTAACAAACATGATACCCACATTAATATT 
CATAAAGCACAGCTGAGTAAAAATGAAGAGCGATTTAAACTGCTGGAGGGTACTTGCTATAATGGAAAGCTCATTTGGA 
AGGTGACAGATTACAAGATGAAGAAGAGAGAGGCGGTGGATGGGCACACAGTGTCCATCTTCAGCCAGTCCTTCTACAC 
CAGCCGCTGTGGCTACCGGCTCTGTGCTAGAGCATACCTGAATGGGGATGGGTCAGG'GAGGGGGTCACACCTGTCCCTA 
TACTTTGTGGTCATGCGAGGAGAGTTTGACTCACTGTTGCAGTGGCCATTCAGGCAGAGGGTGACCCTGATGCTTCTGG 
ACCAGAGTGGCAAAAAGAACATTATGGAGACCTTCAAACCTGACCCCAATAGCAGCAGCTTTAAAAGACCTGATGGGGA 
GATGAACATTGCATCTGGCTGTCCCCGCTTTGTGGCTCATTCTGTTTTGGAGAATGCCAAGAACGCCTACATTAAAGAT 
GACACTCTGTTCTTGAAAGTGGCCGTGGACTTAACTGACCTGGAGGATCTCTA GTCACTGTTATGGGGTGATAAGAGGA 
CTTCTTGGGGCCAGAACTGTGGAGGAGAGCACATTTGATTATCATATTG 



The nucleic acid sequence of NOV 16a maps to chromosome lq32 and has 1376 of 
1379 bases (99%) identical to a Homo sapiens TRAF5 mRNA (gb:GENBANK- 
15 ID:AB000509|acc:AB000509.1) (E = 0.0). 

A NOV16a polypeptide (SEQ ID NO:38) encoded by SEQ ID NO:37 is 568 amino 
acid residues and is presented using the one letter code in Table 16B. Signal P, Psort and/or 
Hydropathy results predict that NOVl 6a does not contain a signal peptide and is likely to be 
localized in the cytoplasm with a certainty of 0.4500. 



Table 16B. NOVl 6a protein sequence (SEQ ID NO:38) 



MAYSEEHKGMPCGFIRQNSGNSISLDPEPSIEYQFVERLEERYKCAFCHSVLHNPHQTGCGHRFCQHCILSLRELNTVPICPV 
DKEVIKSQEVFKI3NCCKREVLNLYVYCSNAPGCNAKVILGRYQQVPLACCYLLQDHLQQCLFQPVQCSNEKCREPVLRKDLKE 
HLSASCQFRKEKCLYCKKDVVVINLQNHEENLCPEYPVFCPNNCAKIIIjKTEVDEHLAVCPEAEQDCPFKHYGCAVTDKRRNIi 
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QQHEHSALREHMRLVLEKNVQLEEQISDLHKSLEQKESKIQQIAETIKKLEK^FKQFAQLFGKNGSFIjPNIQVFASHIDKSAW 
LEAQVHQLLQMTWQQQNKFDLRPLMEATO 

Klil WKVTDYKMKKREAVDGHTVS I FSQSFyTSRCGYRLCARAYI.NGDGSGRGSHLSLYFWMRGEFDSLLQWPFRQRVTLMLL. 
DQSGKKmMErFKPDPNSSSFKKPDGEMNIASGCPRFVAH^^ 

The NOV16a amino acid sequence has 557 of 568 amino acid residues (98%) identical 
to, and 557 of 568 amino acid residues (98%) similar to, a Homo sapiens 557 amino acid 
residue TRAF5 (TNF RECEPTOR ASSOCIATED FACTOR 5) (ptnr.SPTREMBL- 
5 ACC:O00463)(E=1.4e 307 ). 

NOV16a is expressed in at least the following tissues: Aorta, Brain, Cartilage, 
Chorionic Villus, Dennis, Lymphoid tissue, Pituitary Gland, Retina, Thymus, Tonsils and 
Uterus. This information was derived by determining the tissue sources of the sequences that 
were included in the invention including but not limited to SeqCalling sources, Public EST 
10 sources, Literature sources, and/or RACE sources. 



NO VI 6b 

A disclosed NOV16b nucleic acid of 1661 nucleotides (designated CuraGen Acc. No. 
CG561 08-02) encoding a novel TNF Receptor Associated Factor 5 (TRAF5)-like protein is 
shown in Table 16C. An open reading frame was identified beginning with an ATG initiation 
15 codon at nucleotides 7-9 and ending with a TAG codon at nucleotides 1585-1587. Putative 
untranslated regions upstream from the initiation codon and downstream from the termination 
codon are underlined in Table 16C, and the start and stop codons are in bold letters. 



Table 16C. NOV16b Nucleotide Sequence (SEQ ID NO:49) 



CCCACAATGGCTTATTCAGAAGAGCATAAAGGTATGCCCTGTGGTTTCATCCGCCAGAATTCCGGCAACTCCATTTCCT 
TGGACTTTGAGCCCAGTATAGAGTACCAGTTTGTGGAGCGGTTGGAAGAGCGCTACAAATGTGCCTTCTGCCACTCGGT 
GCTTCACAACCCCCACCAGACAGGATGTGGGCACCGCTTCTGCCAGCACTGCATCCTGTCCCTGAGAGAATTAAACACA 
GTGCCAATCTGCCCTGTAGATAAAGAGGTCATCAAATCTCAGGAGGTTTTTAAAGACAATTGTTGCAAAAGAGAAGTCC 
TCAACTTATATGTATATTGCAGCAATGCTCCTGGATGTAATGCCAAGGTTATTCTGGGCCGGrACCAGGATCACCTTCA 
GCAGTGCTTATTTCAACCTGTGCAGTGTTCTAATGAGAAGTGCCGGGAGCCAGTCCTACGGAAAGACCTGAAAGAGCAT 
TTGAGTGCATCCTGTCAGTTTCGAAAGGAAAAATGCCTTTATTGCAAAAAGGATGTGGTAGTCATCAATCTACAGAATC 
ATGAGGAAAACTTGTGTCCTGAATACCCAGTATTTTGTCCCAACAATTGTGCGAAGATTATTCTAAAAACTGAGGTAGA 
TGAACACCTGGCTGTATGTCCTGAAGCTGAGCAAGACTGTCCTTTTAAGCACTATGGCTGTGCTGTAACGATTTCTGAC 
TTACAGAAGAGCCTAGAACAGAAAGAAAGTAAAATCCAGCAGCTAGCAGAAACTATAAAGAAACTTGAAAAGGAGTTCA 
AGCAGTTTGCACAGTTGTTTGGCAAAAATGGAAGCTTCCTCCCAAACATCCAGGTTTTTGCCAGTCACATTGACAAGTC 
AGCTTGGCTAGAAGCTCAAGTGCATCAATTATTACAAATGGTTAACCAGCAACAAAATAAATTTGACCTGAGACCTTTG 
ATGGAAGCAGTTGATACAGTGAAACAGAAAATTACCCTGCTAGAAAACAATGATCAAAGATTAGCCGTTTTAGAAGAGG 
AAACTAACAAACATGATACCCACATTAATATTCATAAAGCACAGCTGAGTAAAAATGAAGAGCGATTTAAACTGCTGGA 
GGGTACTTGCTATAATGGAAAGCTCATTTGGAAGGTGACAGATTACAAGATGAAGAAGAGAGAGGCGGTGGATGGGCAC 
ACAGTGTCCATCTTCAGCCAGTCCTTCTACACCAGCCGCTGTGGCTACCGGCTCTGTGCTAGAGCATACCTGAATGGGG 
ATGGGTCAGGGAGGGGGTCACACCTGTCCCTATACTTTGTGGTCATGCGAGGAGAGTTTGACTCACTGTTGCAGTGGCC 
ATTCAGGCAGAGGGTGACCCTGATGCTTCTGGACCAGAGTGGCAAAAAGAACATTATGGAGACCTTCAAACCTGACCCC 
AATAGCAGCAGCTTTAAAAGACCTGATGGGGAGATGAACATTGCATCTGGCTGTCCCCGCTTTGTGGCTCATTCTGTTT 
TGGAGAATGCCAAGAACGCCTACATTAAAGATGACACTCTGTTCTTGAAAGTGGCCGTGGACTTAACTGACCTGGAGGA 
TCTCTAG TCACTGTTATGGGGTGATAAGAGGACTTCTTGGGGCCAGAACTGTGGAGGAGAGCACATTTGATTATCATAT 



The nucleic acid sequence of NOV16b maps to chromosome lq32 and has 1013 of 
20 1 05 1 bases (96%) identical to a Homo sapiens TRAF5 mKNA (gb:GENBANK- 
ID:AB000509|acc:AB000509.1) (E - 0.0). 
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A NOV1 6b polypeptide (SEQ ID NO:50) encoded by SEQ ID NO:49 is 526 amino 
acid residues and is presented using the one letter code in Table 16D. Signal P, Psort and/or 
Hydropathy results predict that I^OV16b contains a signal peptide and is likely to be localized 
to the cytoplasm with a certainty of 0. 4500. 



Table 16D. NOV16b protein sequence (SEQ ID NO:50) 



MAYSEEHKGMPCGPI RQNSGNS I SLDFEPS I EYQFVERLEERYKCAFCHS VIiHNPHQTGCGHRFCQHCI LSLRELNTVPI CPV 
DKEVIKSQEVFKDNCCKREVLNLYVYCSNAPGCNAKVILGRYQDHLQQCLFQPVQCSNEKCREPVLRKDLKEHLSASCQFRKE 
KCLYCKKDVWINLQNHEENLCPEYPVFCPNNCAKIILCT^ 

LxA ET I KKLE KE F KQ FAQL FG KNG S FLPN I QVFASH I DKS AWLE AQVHQLLQMVNQQQNKFDIiR PLMEAVDT VKQKI TLDENND 
QRLAVLEEETNKHDTHINIHKAQLSKWEERFKLLEGTCYNGKLIWKVTDYKMKKREAVDGHTVSIFSQSFYTSRCGYRLCARA 
YLNGDGSGRGSHLSLYFWMRGEFDSLLQWPFRQRVTLMLLDQSGKKNIMETFKPDPNSSSFKRPDGEMNIASGCPRFVAHSV 
LENAKNAYI KfeDTLFLKVAVPLTDLEDL 



The NOV16b amino acid sequence has 326 of 421 amino acid residues (77%) identical 
to, and 348 of 421 amino acid residues (82%) similar to, a Homo sapiens 557 amino acid 
residue TRAF'5 (TNF receptor associated factor 5) (ptnr:SPTREMBL-ACC: 000463) (E = 
7.2e" 162 ). 

Possible small nucleotide polymorphisms (SNPs) found f6r NOV16a are listed in 
Table 16E. 



Table 1 


6E: SNPs 


Variant 


Nucleotide 
Position 


Base 
Change 


Amino 
Acid 
Position 


Base 
Change 


13375596 


1402 


A > G 


466 


Arg > Gly 



NOV 1 6a and NOV 1 6b are very closely homologous as is shown in the amino acid 
alignment in Table 16F. ■ , 

Table 16F Amino Acid Alignment of NOV16a and NOV16b 



NOV16b 
MOVl6b 



NOV16b 
N0V16b 



NOV16b 
NOV16b 



20 30 40 50 60 70 
.|....|....|....|....| |....|....|....|....| I 



MAYSEEHKGMPCX3FIRGi|SGfflS I SLDFEPS IE 

^yseehkgmpcgfir^sqIBsisldfepsieyqfveriiEerykc^ 




SLRELjjjTVPICPVDKEVIKSQEVFKDgGCKRE 

slrelBtvpicpvdkeviksqevfkdiScc: 



150 



170 180 



200 210 
• - I I I 



QQCfcFQPVQCSffiEKCREPVLRKDLKEHI^ASCQFRK^ 

qqclfqpvqcsBekcrep^rkdlkehlsascqfrkekclyckkdvwiiSlqSheeSlcpeyp^ 



NOV16b |V 
NOV16b ™ 



220 230 
■ |....|....|....| • . 



KI ILKTEVDEHLAVCPEAEQDCPFKHYGCAVT 
IILKTEVDEHLAVCPEAEQDCPFKHYGCA' 



240 250 260 270 280 

.|....|....|....|....|....|....J....| 
DKRRN LQQH EH S ALR EH MRLVLE KNVQLEEQ? 



290 
.|....|.. 



300 



NOV16b 
NOV16b 



SLEQKESKIQQIAETIKKLEKEFKQFAQLFG 
5 LEQKESKIQQLAETI KKLEKE FKQFAQLFG 



370 




330 



340 



m 

350 



390 
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NO VI 6b 
NOVlfib 



Novieb 



NOVl 6b 
NOVl 6b 



FDIjRPJjMEAVDTVKQICtTLLE^DQRId^^ 

FT)LRPI^EAVDTVKQKXrLLE^DQRZAVXEEETrSKHDrHlS IHKAQIiSkSeERFKLLEGTCySgKLIW 



S»3 



430 ,440 



450 460 
..|....|....|.. 



.470 480 ,490 



VTDYKMKKREATOGHTVSIFSQSFYTSRCGYRfc^^ 

YKMKKREAVPGHTVSIFSQSm'SRCGYRLCAl^YljSgDG 



QRVTLMJjLDQSG 
QRVTLMLLDQSG 



520 530 



540 



IMETFKPDP 5SSSFKRPDGEMBIASGCPRFVAHS VLEj 
IMETFKPDP SsSSFKRPDGEMSlRSGCPRFVAHSVLEi 



550 560 
Lr-J 



Novaeb 

NOVl 6b 



Homologies to any of the above NOV 16 proteins will be shared by the other NOV 16 
proteins insofar as they are homologous to each other as shown above. Any reference to • 
NOV16 is assumed to refer to both of the NOV16 proteins in general, unless otherwise noted. 

NOVl 6a also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 16G. 



Table 16G. BLAST results for NOVl 6a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 


gi 1 11321603 fref )NP 


TNF receptor- 
associated 
factors [Homo . 
sapiens] 


557 


557/568 
(98%) 


557/568 
(98%) 


0.0 


004610 , 1 | 
(NM_0O4619) 


gi j 2138180 jgb}AAC51 


TNF receptor 

associated 
factors [Homo 
sapiens] 


538 


538/549 
(97%) 


538/549 
(97%) 


0.0 


329.1 | (U69108) 


gi | 1549146 ( dtrj J B7AA1 


TRAF5 [Mus 
musculus] 


558 


445/569 
(78%) 


497/569 
(87%) 


0 . 0 


1942.JLJJD83528) 


gi | 6755867 | ref |NP 0 


TNF receptor- 

associated 
factors [Mus 
musculus] 


558 


444/569 
(78%) 


496/569 
(87%) 


0.0 


35763 ,l| 
(NM_011633) 


gi | 14754079|ref |XP 


TNF receptor- 
associated 
factor3 [Homo 
sapiens) 


568 


232/562 
(41%) 


354/562 
(62%) 


e-17 


040913.1| 
(XM 040913) 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
10 in Table 16H. 

Table 16H ClustalW Analysis of NOVl 6a 

1) NOV 1 6a (SEQ ID NO:38) 

2) gi|ll321603(reflNP 004610.1| (NM_004619) TNF receptor-associated factor5 [Homo sapiens] (SEQ ID 
NO: 162) 

3) gj|2 1383 80lgblAA C5 1 329. \ \ (V69\ 08)TNF receptor associated factor5 [Homo sapiens] (SEQ ID NO: 1 63) 

4) gi,11549146ldbi|BAAn942.1| (D83528)T3ElAF5 [Mus musculus] (SEQ ID NO: 164) 

5) gil6755 867treflNP 035763.1) (NMJ)1 1633)TNF receptor-associated factor5 [Mus musculus] (SEQ ID 
NO: 165) 

6) ftiU4754079|reflXP 040913. 1| (XMJ)40913)TNF receptor-associated factor3 [Homo sapiens] (SEQ ID 
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NO: 166) 



NOV16 

gi|ll3216j03| 
gi|2138180| 
gi [1549146 [ 
gi (6755867 j 
gijl4754079 1 



NOV16 

gi|ll321603 | 
gi | 2138180 j 
gi|l549146 j 
gi|6755867 [ 
gi|l4754079| 



NOV16 

giill321603| 
gi 2138180] 
1549146 | 
6755867| 
14754079 | 



91 

gi 
gi 



NOV16 

gijll321603 j 
gil2138180 j 
gi (1549146 J 
gi [6755867 | 
gi|l4754079| 



NOV16 

gi|H321603 | 
gi(2138180| 
gi [1549146 | 
gi [6755867 j 
gi |14754079 j 



NOV16 

gi | 11321603 [ 
gi[2138180 j 
gi (1549146 j 
gi [6755867 j 
gi)l4754079] 



NOV16 

gi|H321603 [ 
gi(2138180| 
gi 1 1549146 j 
gi|6755867| 
gi [14754079| 



NOV16 

gi|H321603 | 
gi|2138180| 
gi j 1549146 j 
gi }6755867| 
gi}!4754079 | 



NOV16 




570 



580 
. . 1 . . 



VIKDDTLPLKVAVDIiTDIiED 
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gi 
gi 
gi 
gi 
gi 



11321603 1 
2138180 | 
1549146| 
6755867] 
14754079| 




Tables 161 - 16L list the domain description from' DOMAIN analysis results against 
NOV16a. This indicates that the N0V16a sequence has properties similar to those of other 
proteins known to contain these domains. ' 



Table 161 Domain Analysis of NOV16a 

gnl 1 pf am| pf amO 0917 , MATH, MATH domain. This motif has been called the 
Meprin And TRAF - Homol bgy (MATH) domain. This domain 'is hugely expanded 
in the nematode C. elegans. (SEQ ID NO: 167) 
Length = 116 residues, ,92.2% aligned 

Score = 72,8 bits (177), Expect 5e-l4 



NOV16 : 


436 


00917 : 


10 


NOV1 6 : 


496 


00917 j 


64 


NOV 16 : 


556 


00917 : 


110 



TVSIFSQSFYTSRCGYRLCARAYLNGDGSGRGSHLSLYFWMRGEFDSLLQWPFRQRVTL 4 95 

+1 II I I II III + +II II l+l II 

EGEEYYTSPVEERFGI PWRLRI YRNGG FLGLYLHCLKGEKDSNLKVJSIEAEFTL 63 

MLLDQSGKICNIMETFKPDPNSSSFKRPDGEMNI ASGCPRFVAHSVLENAKNAYI KDDTLF 555 

1+ +11 I II II + I++ II + 1+ llll 

KLVSDNGKSL TKKPK HVFEKPTGEGWG KFISWDDLE DDYLVDDTLI 109 



LKVAVDL 
+ + | + 
IEAEVKI 



562 
116 



Table 163 Domain Analysis of NOV16a 

gnl 1 Pf amlp f am02176 , zf-TRAF, TRAF- type zinc finger. (SEQ ID NO:168) 
Length = 57 residues, 100.0% aligned 

Score = 58.9 bits (141), Expect » 8e-10 



NOV16: 139 HLQQCLFQPVQCSNEKCREPVLRKDLKEHLSASCQFRKEKCLY CKKDWVINLQ 192 

I + I MM I II + +11 + 11 +1 1 1 II I I + II l + l ill 

02176: 1 HEKTCPFVPVPCPN-KCGKKILREDLPDHLSADCPKRPVPCPFKVYGCKVDMVRENLQ 57 



Table 16K Domain Analysis of NOV16a 

ghl | Smart j smart00061 , MATH , meprin and TRAF homology (SEQ ID NO: 16 9) 
Length = 100 residues, 100.0% aligned 

Score = 47.4 bits (111), Expect = 2e-06 



10 



NOV16 : 
00061 :, 
NOV1 6 : 
00061 : 
NOV16: 
00061 : 



419 WKVTDYKMKKREAVDGHTVSIFSQSFYTSRCGYRLCARAYLNGDGSGRGSHLSLYFWMR 4 78 

+1 + llll +11 + Mil 

1 VLSHTFKNVSKF EEGESYFSPSEEHFNI PWRLKI Y- - - RKNGFLSLYLHCEK 4 9 

4 79 GEFDSLLQWPFRQRVTLMLLDQSGKKNIMETFKPDPNSSSFKRPDGEMNIASGCPRFVAH 53 8 

I II +1 II 1+ l+ll ++ I++I | I +I++ 

50 EENDSR- KWSIEAEFTLKLVSQNGKS LSKKDTHVFEKPGG WGFSKFISW 97 

539 SVL 541 

I 

98 DDL 100 
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( t Table X6L Domain Analysis of NOV1 6a 

gnl | Smart | smart00l84 , RING, Ring finger; E3 ubi qui tin- protein ligase 
activity is 1 intrinsic i to the RING domain of c-Cbl and is liWely to be 
a general function of this domain,/ Various RING fingers exhibit 
binding activity towards E2 ubiquit in- conjugating enzymes (UbC s) 
(SEQ ID NO: 170) 

Length =41 residues, 97.6% aligned 

Score = 38.5 bits (88 ) , Expect « 0.001 

NOV16: 4 5 CAFCHS VL- HNPHQTGCGHRFCQHCI LSLRELNTVP I CPV 83 

II +1 .III ||+ I'l I + '|| + 

00184: 1 CPICLEEYhKDPWIiPeGHTFCRSCIRKWLESSNSNTCPI 40 

Tumor necrosis factor (TNF; 191 160) receptor-associated factors (TRAFs) are signal 

i • 

transducers for members of the TNF receptor superfamily (see 191 190). TRAP proteins are 
composed of an N-terminal cysteine/histidine-rich region containing zinc RING and/or zinc 
finger motifs, a coiled coil (leucine zipper) motif, and a homologous region in the C terminus 
that defines the TRAF family, the TRAF domain. The TRAF domain is involved in self- 
association and receptor binding. By degenerative oligonucleotide PGR amplification, Nakano 
et al. (1996) identified TRAF5 in the mouse and showed that its specifically interacts with the 
lymphotoxin-beta receptor (600979) and activates the transcription factor NF-kappa-B (see 
1 6401 1). Nakano et al. (1997) cloned the human TRAF homolog by cross hybridization with 
mouse TRAF5 cDNA. Their human cDNA of 2,894 bp has a 557-amino acid open reading 
frame that exhibits 77.5 and 80% identity to mouse TRAFS at the nucleotide and amino acid 
levels, respectively. Northern blot analysis revealed that human TRAFS mRNA is expressed in 
all visceral organs. Western blotting revealed that the human protein is abundantly expressed 
in a human follicular dendritic cell line, and to a lesser degree in several tumor cell lines. 
Interspecific backcross mapping showed that TrafS is located in the distal region of mouse 
chromosome 1, which shares homology with human lq. Fluorescence in situ hybridization 
confirmed the regional localization of human TRAFS to chromosome lq32. To investigate the 
functional role of Traf5 in vivo, Nakano et al. (1999) generated TrafS-deficient mice by gene 
targeting. They found that Traf5 -/- B lymphocytes show defects in proliferation and 
upregulation of various surface molecules, including CD23 (1 SI 445), CD54 (147840), CD80 
(1 12203), CD86 (601020), and FAS (134637) in response to CD40 (109535) stimulation. 
Moreover, in vitro Ig production by Traf5 -/- T lymphocytes stimulated with anti-CD40 plus 
IL4 (347780) was reduced substantially. CD27-mediated costimulatory signal also was 
impaired in Traf5 T lymphocytes. Collectively, these results demonstrated that TrafS is 
involved in CD40- and CD27-mediated signaling. 
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A cDNA encoding the human homolog of the tumor necrosis factor receptor-associated 

factor 5 (TRAF5) protein has been molecularly cloned from a cDNA library of Human Daudi 

B cell line. The sequence analysis revealed tjipt the cDNA encoded a protein of 557 aa 

residues with a calculated molecular weight of 64,236. The encoded protein has typical 

structural characteristics shown in the TRAF family of proteins and binds to the cytoplasmic 

region of lymphotoxin-beta receptor more efficiently than to that of CD40 and GD30. The 

TRAF5 gene was mapped to the 'human chromosome lq32.3-q4Ll. Overexpression of human 

TRAF5 activates NF kappa B transcription factor in human 293T kidney cells. These results 
i 

suggest that the human TRAF 5 protein could be involved in the signal triggered by various 
members of the tumor necrosis factor receptor (TNFR) superfamily including CD40, CD30 
and lymphotoxin-beta receptor (Mizushima S, FujitaM, Ishida T, Azuma S, Kato K, Hirai M, 
Otsuka M, Yamamoto T, Inoue J Cloning and characterization of a cDNA encoding the human 
homolog of tumor necrosis factor receptor- associated factor 5 (TRAF5). PMID: 951 1754 
Gene 1998 Jan 3P;207(2): 135-40) 

Members of tumor necrosis factor receptor (TNFR) family signal largely through 
interactions with death domain proteins and TRAF proteins. Here we report the identification 
of a novel TNFR family member ATAR. Human and mouse ATAR contain 283 and 276 
amino acids, respectively, making them the shortest known members of the TNFR 
superfamily. The receptor is expressed mainly in spleen, thymus, bone marrow, lung, and 
small intestine. The intracellular domains of human and mouse ATAR share only 25% 
identity, yet both interact with TRAF5 and TRAF2. This TRAF interaction domain resides at 
the C-terminal 20 amino acids. Like most other TRAF-interacting receptors, overexpression of 
ATAR activates the transcription factor NF-kappaB. Co-expression of ATAR with TRAF5, 
but not TRAF2, results in synergistic activation of NF-kappaB, suggesting potentially different 
roles for TRAF2 and TRAF5 in post-receptor signaling (Hsu H, Solovyev I, Colombero A, 
Elliott R, Kelley M, Boyle WJ ATAR, a novel tumor necrosis factor receptor family member, 
signals through TRAF2 and TRAF5. PMID: 9153189 J Biol Chem 1997 May 
23;272(21):13471-4). 

Tumor necrosis factor (TNF) receptor-associated factors (TRAFs) are signal 
transducers for members of the TNF receptor superfamily. We previously identified murine 
TRAF5 (mTRAF5) and showed that it specifically interacts with the lymphotoxin-beta 
receptor (LT-beta R) and activates the transcription factor NF-kappa B. Here we have cloned 
the human TRAF5 homologue (hTRAF5) by cross hybridization with mTRAF5 cDNA. 
hTRAF5 cDNA is composed of 2894 nucleotides with a 557-amino-acid open reading frame 
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that exhibits 77.5 and 80% identity to mTRAF5 at the nucleotide and amino acid levels, 
respectively. Northern blot analysis revealed that hTRAF5 niRNA is expressed in all visceral • 
organs. Western blotting revealed that hTRAFS protein was abundantly expressed in the 
human follicular dentritic cell line, FDC-1, and to a much lesser degree in several tumor cell 
5 lines. Interspecific backcross mapping revealed that Traf5 is located in the distal region of 
mouse chromosome 1, which shares a region of homology with human chromosome Iq. 
Fluorescence in situ hybridization confirmed regional localization to human chromosome 
lq32 (Nakano H, Shindo M, Yamada K, Yoshida MC, Santee SM, Ware CF, Jenkins NA, 
Gilbert DJ, Yagita H, Copeland ftC, Okumura K Human TNF receptor- associated factor 5 

1 0 (TRAF5): cDNA cloning, expression and assignment of the TRAF5 gene to chromosome 
lq32. PMID: 9177772 Genomics 1997 May 15;42(l):26-32). 

The above defined information for NOV16 suggests that this NOV16 protein may 
function as a member of a TNFR protein family. Therefore, the NOV 16 nucleic acids and 
proteins of the invention are useful in potential therapeutic and diagnostic applications. For 

1 5 example, a cDNA encoding the NOV1 6 protein may be usefiil in gene therapy, and the 

NOV1 6 protein may be useful when administered to a subject in need thereof By way of 
nonlimiting example, the compositions of the present invention will have efficacy for 
treatment of patients suffering from diseases or disorders involving signal transduction and 
members of the TNF receptor superfamily, as well as, CR1 deficiency; Cardiomyopathy, 

20 familial hypertrophic, 2; Cardiomyopathy, familial, dilated-2 ; Ectodermal dysplasi a/skin 
fragility syndrome; Epidermolysis bullosa, Herlitz junctional type; Epidermolysis bullosa, 
generalized atrophic benign; Factor H deficiency; Glomerulopathy, fibronectin; Hemolytic- 
uremic syndrome; Hemolytic-uremic syndrome; Membroproliferative glomerulonephritis; 
Nephropathy, chronic hypocomplementemic; Popliteal pterygium syndrome; Severe combined 

25 immunodeficiency due to PTPRC deficiency; Chitotriosidase deficiency; Flyperproreninemia; 
van der Woude syndrome; SLE susceptibility; Malignant hyperthermia susceptibility 5; 
Measles, susceptibility to; Multiple sclerosis, susceptibility to; AIDS, cancer, immunological 
disorders and diseases; viral and bacterial susceptibility and immunity and/or septic shock. 
The NOV 16 nucleic acid encoding TNFR-like protein, and the TNFR-like protein of the 

30 invention, or fragments thereof, may further be useful in diagnostic applications, wherein the 
presence or amount of the nucleic acid or the protein are to be assessed. 

NO VI 7 
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A disclosed NOV 17 nucleic acid of 622 nucleotides (designated CuraGen Acc; No. 
CG56101-01) encoding a novel Ferritin light chain-like protein is shown in Table 17A. An 
open reading frame was identified beginning with an ATG initiation codon 'at nucleotides 18- 
20 and ending with a TAG codon at nucleotides 603-605. Putative untranslated regions 
5 upstream from the initiation codon and downstream from the termination codon are underlined 
in Table 1 7A. and the start and stop codons are in bold letters. 

Table 17A. NOVl 7 Nucleotide Sequence (SEQ ID NO:39) \ 

" CTCCTTTCCAA AGAACCATGAGTTCCCACW 
CCACCGGC^GCTGCGGGCTTCCCTTACCTACCTCTCTCTCATCCTCCATTTCTACCGCGACGACGTGACCCTGGAGGGC 
ATGGG C CACTT C CGAG AGCTGGC CC AGG AGAAGCG ACAGGGCG CCC AG AGTC TGTGG AAG ACGC AAAACCAG CG CGG AG 
CCCTCTGCGATGCCATCCAGAAGCCGTCCTGGGATGAAAAGGACAGCAGTTTGGGCGCCCTGCGAGCCGCGTTGGCCCT 
GGAGACGAACCTGAACCAGGCCCTGCTGGATCTGCACGCCCTGGGCGCAAAGCATGCAGACTCTCACCCCTGCGGCTTC 
CTGGAGAACCACTTCCGGCCACATCCCTCTGTCAGACCTGGGAAAGCGTCCACCCGAGCTGCTCCCTTCAACCTCAAGA 
TACATTTTTTTTCTTTCTTTCTTTTTGAAAGAGTCTCCCTGCGTGTAGACCCCTGGACTATTGATTGCACCACATTCAT 
TCCTTCCCCAGCTCACTACTCCAACAAGGTACCAAATAT A CCAAATATiTTAGAGA AT TAGGATGAAerA 

The nucleic acid sequence of NOVl 7 maps to chromosome 4 and has 191 of 246 bases 
(77%) identical to a Oiyctolagus cuniculus ferritin light chain subunit rnRNA 
10 (gb:GENBANK-ID;OCFERL|acc:X07830.1) (E - 6.2e" 25 ). 

A NOVl 7 polypeptide (SEQ ID NO:40) encoded by SEQ ID NO:39 is 201 amino acid 
residues and is presented using the one letter code in Table 17B. Signal P, Psort and/or 
Hydropathy results predict that NOVl 7 does n,ot coritain a signal peptide and is likely to be 
localized in the cytoplasm with a certainty of 0.4500. 

Table 17B. NOVl 7 protein sequence (SEQ ID NO:40) 

MSSHISQNYCT^ 

WDEKDSSLGALRAALALETNLNQALLDLHALGAKHADSHPCGFLENHFRPHPSVRPGKASTRAAPFNLKIHFFSFFLFERVSL 
RVDPVJTIDCTTFIPSPAHYSNKVPmPNIXRIRMN 

15 : 1 

The NOV17 amino acid sequence has 95 of 175 amino acid residues (54%) identical 

to, and 1 13 of 175 amino acid residues (64%) similar to, a Oiyctolagus cuniculus 175 amino 

acid residue ferritin light chain protein (prnr:pir-id:S01239) (E = 1.8e" 37 ). 

NOV 17 is expressed in at least the following tissues: colon tunior. This information 

20 was derived by determining the tissue sources of the sequences that were included in the 
invention including but not limited to SeqCalling sources, Public EST sources, Literature 
sources, and/or RACE sources and the expression pattern of a closely related Oryctolagus 
cuniculus ferritin light chain subunit mRNA homolog (gb:GENBANK- 
ID:OCFERL|acc:X07830.1) 

25 NOVl 7 also has homology to the amino acid sequences shown in the BLASTP data 

listed in Table 17C. 
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Table 17C. BLAST results for NO VI 7 



Gene Index/ 
Identifier 



Protein/ Organism 



Length 
Caa) 



Identity 
(%>. 



Positives 
(%) 



Expect 



gi | 60160 50 | sp|Q4641 
5 | FRIIj BOVIN 



FERRITIN LIGHT 
CHAIN (FERRITIN L 
SUBUNIT) [Bos 
taurus] 



175 



82/13-4 
(61%) 



92/134 
(68%) 



2e-35 



gi 1 7739645 |gbf AAF68 
948,l|AF230928 1 
(AF230928) 



ferritin light 
chain [Cavia 
porcellus} 



175 



81/134 
(60%) 



94/134 
(69%) 



6e-35 



gi | 204131 |gb|AAA4ll 
54 .11 (K01930) 



ferritin light 
chain subunit 
[Rattus 
norvegicus] 



183 



81/134 
(60%) 



93/134 
(68%) 



2e-34 



gi[H6974l|sp|P0279 
ljFRIL HORSE 



FERRITIN LIGHT 
CHAIN (FERRITIN L 
SUBUNIT) [Equus 
caballus] 



175 



81/134 
(60%) 



94/134 
(69%) 



2e-34 



gi 1 6679873 | ref j NP 0 
32075. 1| 
(NM 008049) 



ferritin light 
chain 2 [Mus 
musculus] 



183 



80/134 
(59%) 



92/134 
(67%) 



5e-34 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 17D. 

Table 17D ClustalW Analysis of NOV17 

1) NOV17(SEQIDNO:40) 

2) gi|6016050lsplO46415[FRJL BOVIN FERRITIN LIGHT CHAIN (FERRITIN L SUBUNIT) [Bos taurus] 
(SEQ ID NO: 171) 

3) gi |7739645 1ebiA AF68948.1|AF230928_l (AF230928) ferritin light chain [Cavia porcellus] (SEQ ID 
NO: 172) 

4) gil 2041311gbiAAA41154.il (K01930) ferritin light chain subunit [Rattus norvegicus] (SEQ ID NO:173) 

5) gi ll 169743 |splP02791|FRIL_HORSE FERRITIN LIGHT CHAIN (FERRITIN L SUBUNIT) [Equus 
caballus] (SEQ ID NO: 174) 

6) gil6679873lreflIW_032075.il (NM_008049) ferritin light chain 2 [Mus musculus] (SEQ ID NO: 175) 



NOV17 

gi|6016050) 
gi |7739645 | 
gi|204131 | 
Cfi|ll69741 | 
gi|6679873 j 



NOV17 

gi j 6016050 ( 
gi | 7739645 j 
gi | 204131 | 
gijll69741 | 
gij 6679873 j 





W__KDSg _GgLR| 

S iE S2 ,T 




NOV17 

gi | 6016050 j 
gi [7739645 j 
gij 204131 | 
gi 111697411 
gi |6679873 j 




NOV17 RIRMN 
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gi 
gi 



6016050 I 
7739645] 
204131| 
1169741) 
6679873 J 



Table 17E lists the domain description from DOMAIN analysis results against 
NOV17. This indicates that the NOV17 sequence has properties similar to those of other 
proteins known to contain these domains. 1 



Table 17E. Domain Analysis of NO VI 7 

gnl [Pf am) pf am002l0 , ferritin, Ferritin. {SEQ ID N0:176) 
Length = 155 residues , 7*8 . 1% aligned 

Score ~ 104 bits (260), Expect = 4e-24 



NOV17: 14 EAAVSSLVHRQLRASLTYLSLILHFYRDDVTLEGMGHF-RELAQEKRQGAQSLWKTQNQR 72 

III- ++ +\ II III- +! Illl II III* 1-1+ l + I I IM 

00210: 2 EAALNRQINLiELYASYVyLSMAAYFDRDDVALPGFAKFFREASHEEREHAEKLMKYQNKR 61 
NOV17: 73 GALCD^-AIQKPSWDEKDSSLGALRAALALETNLNQALLDLHAIjGAiaiADSHPCGFLEN^ 13 0 

I Ml II I I k+ II +. +111 III- 

00210: 62 GGRWLQDIKKPEKDEWGSGLEAMQTALQLEKSWQSLbELHKVATDNNDPHLCDFLESE 121 

NOV17: 131 F 131 

+ 1 
00210; 122 Y 122 



Ferritin is one of the major non-hem e iron storage proteins. It consists of a mineral 
core of hydrated ferric oxide, and a multi-subunit protein shell which englobes the former and 
assures its solubility in an aqueous environment. 

10 In animals the protein is mainly cytoplasmic and there are generally two or more genes 

that encodes for closely related subunits (in mammals there are two subunits which are known 
as H(eavy) and L(ight)). In plants ferritin is found in the chloroplast. 

There are a number of well conserved region in the sequence of ferritins. We have 
selected two of these regions to develop signature patterns. The first pattern is located in the 

1 5 central part of the sequence of ferritin and it contains three conserved glutamate which are 

thought to be involved in the binding of iron. The second pattern is located in the C-terminal 
section, it corresponds to a region which forms a hydrophilic channel through which small 
molecules and ions can gain access to the central cavity of the molecule; this pattern also 
includes conserved acidic residues which are potential metal binding sites, 

20 Ferritin is the major intracellular iron storage protein in all organisms. It has the shape 

of a hollow sphere that permits entry of a variable amount of iron for storage as ferric 
hydroxide phosphate complexes. Mammalian liver and spleen ferritin (relative mass about 
450,000) consists of 24 subunits of 2 species, the heavy subunit (relative mass = 21,000) and 
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the light subunit (relative mass = 19,000). Brown et al. (1983) presented (evidence that, in the 
rat, the 2 species of subunits are coded by separate mRNAs. Furthermore, a family of genes 
appear to encode the light subunit. 'Studies of ferritin synthesis in cell-free systems suggested 
that the H and L subunits have different mRNA molecules (Watanabe and Drysdale, 1981). By 
5 study of human/Chinese hamster hybrid cells and use of a radioimmunoassay specific for 

human ferritin, Caskey et al. (1983) showed that chromosome 19 encodes the structural gene 
for ferritin. Thus, mutation in the structural gene for ferritin is not responsible for 
hemochromatosis , since that disorder is coded by chromosome 6. By in situ hybridization, 
McGill et al. (1984) confirmed the assignment of the light chain gene to chromosome 19 but 

0 concluded that the heavy chain is encoded by lp. By study of hamster-human and mouse- 

i 

human hybrid cells, some with translocations involving chromosome 19, Worwood et al. 
(1985) concluded that light subunits of ferritin (rich in human spleen ferritin) are coded by a 
gene in segment 19ql3.3-qter and that the gene for the heavy subunit (rich in human heart 
ferritin) is located on chromosome 1 1. By miniaturized restriction Enzyme analysis of sorted 

.5 chromosomes, Lebo et al. (1985) demonstrated ferritin light-chain genes on at least 3 

chromosomes. Munro et al. (1988) reviewed information on the ferritin genes. They pointed 
out that in both the rat and the human, several ferritin pseudogenes can be recognized not only 
because they are flanked by 5-prime and 3-prime direct repeats representing the site of their 
retroinsertion into the chromatin, but also because they differ from functional genes by the 

:0 absence of introns and by the presence of polyadenylic acid tails that have been inserted onto 
the 3-prime end of the messenger transcription of the functional gene. They cited the evidence 
of Santoro et al. (1986) and of Hentze et al. (1986) that there is only one expressed H and one 
expressed L gene in the human genome. 

The above defined information for NOV17 suggests that this NOV17 protein may 

'5 function as a member of a Ferritin light chain protein family. Therefore, the NOV17 nucleic 
acids and proteins of the invention are useful in potential therapeutic and diagnostic 
applications. For example, a cDNA encoding the NOV 17 protein may be useful in gene 
therapy, and the NOV17 protein may be useful when administered to a subject in need thereof. 
By way of nonlimiting example, the compositions of the present invention will have efficacy 

'0 for treatment of patients suffering from hyperferritinemia and cataract and/or Cancer. The 

NOV17 nucleic acid encoding Ferritin light chain-like protein, and the Ferritin light chain-like 
protein of the invention, or fragments thereof, may further be useful in diagnostic applications, 
wherein the presence or amount of the nucleic acid or the protein are to be assessed. 

NOV18 
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A disclosed NOV1 8 nucleic acid of 781 nucleotides (designated CuraGen Acc. No. 
CG56095-01) encoding a novel Neurotrophin-6 alpha-like protein is shown in Table 18 A. An 
open reading frame was identified beginning, with an AGC codon at nucleotides 67-69 and 
ending with a TGA codon at nucleotides 778-780. Putative untranslated regions upstream 
5 from the initiation codon and downstream from the termination codon are underlined in Table 
1 8A, and the start and stop codons are in bold letters. 

Table 18A. NOV] 8 Nucleotide Sequence (SEQ ID NO:41) 

atttactattaatctacaagttgggtgttatgcfiagtcctatatatggagtcccccaaacttctag agcaagggcttcc 
ccataatccItggcaggcaggCctcccctggggttcccaacttctgaccccactgaagtgtttatcctcttctctaatcc 
cagcctccttttccctgtctccatgtgctc^gagaggtgctctgagagatgctcgcgctcccccagactccctctacat 
ccccctcattttcttcctctccagtgtgtcaatggggtcctaaccccaccctcgacattgtcgccttttcctgatccaa 
agtgggaccttcttttcccccgagtggtcctgcctaggggtgccgctgccgggccccctctggtcttcctgctggagac 
tgggg cdtttcgggagtcag c agg cgc c cggg cc aaccg gag ccag cg agg g gtg agcaat acttcactgg cgagtc at 
cagggtgagctggccgtgtgtgatgcagtcactggctgggtgacagacccccggaccgctgtggactcaggtgtgctgg 

AGGTGGAGGTGTTGGGCGAGGTGCCTGCAGCTGGCGGCAGTTCCCTCTGCCAACACTTCTTTGTCACCTGCTTCGAGGC 
CAATAACTCTGAAGAAGGTGGCCCAGGGGTAGGTGGAGGGGCTGCCGCAGGGGTGTGGACCGdGGGGCACTGGGTGTCT 
GAGTGCAAGGCGAAGCAGTCCTATGTGCGGGCATTGACCGCTQATGCCCAGGGCTGTGTGGACTGGTGAT 

The nucleic acid sequence of NOV18 maps to clrromosome 19 and has 740 of 781 
bases (94%) identical to a Homo sapiens ]s[T6-alpha-acidic neurotrophin 6 alpha mRNA 
10 (gb:GENBANK-ID:S41522|acc:S41522.1) (E= 8.6e 154 ). 

A NOV1 8 polypeptide (SEQ ID NO:42) encoded by SEQ ID NO:41 is 237 amino acid 
residues and is presented using the one letter code in Table 1 8B. 

Table 18B. NOV18 protein sequence (SEQ ID NO:42) 

SKGFPIILAGRPPLGFPTSDPTEVFIIiFSNPSLLFPVSMCSERCSERCSRSPRLPLHPPHFLPLQCVNGVLTPPSTIjSPFPDP 
KWDLLFPRWLPRGAAAGPPIiVFLLETGAFRESAGARANRSQRGVSNTSLASHQGELAVCDAVTGWVTDPRTAVDSGVLEVBV 
LGEVPAAGGSSLCQHFFVTCFEANKSEEGGPGVGGGAAAGVMTGGHWVSECKAKQSYVRALTADAQGCVDW 

The NOV1 8 amino acid sequence has 215 of 237 amino acid residues (90%) identical 
1 5 to, and 220 of 237 amino acid residues (92%) similar to, a Homo sapiens 257 amino acid 
residue neurotrophin-6 alpha protein (NT-6 alpha) (pmr:SWISSPROT-ACC:P34132) (E = 
3.1e 113 ). 

NOV18 is expressed in at least the following tissues: Placenta, Uterus and Whole 
Organism. This information was derived by determining the tissue sources of the sequences 
20 that were included in the invention including but not limited to SeqCalling sources, Public 

EST sources, Literature sources, and/or RACE sources and because of the expression pattern 
of a closely related Homo sapiens NT6- alpha- acidic neurotrophin 6 alpha mRNA homolog 
(gb:GENBANK-ID:S41522|acc:S41522.1). 

Possible small nucleotide polymorphisms (SNPs) found for NOV1 8 are listed in Table 

25 18C. 

Table 18C:" SNPs 
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Variant 


Nucleotide 
Position 


Base 
Change 


Amino 
Acid 
Position 


Base 
Change 


13376701 


314 


C>T 


84 


Pro > Leu 


13375249 


322 


G> A 


. 86 


Asp > Asn 


13376702 


377 


T>C ' 


104 


Leu > Pro 



NOV18 also has homology to the amino acid sequences shown in the BLASTP data 



listed in Table 18D. 



Table 18D. BLAST results for NOV18 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect, 


gi|462743 j sp|P34132 


NEUROTROPHIN-6 
ALPHA (NT- 6 
ALPHA) [Homo 
sapiens] 


257 


160/237 
(67%) 


165/237 
(69%) 


le-74 


jNT6A HUMAN 


gi |462744 | sp|P34133 


NEUROTROPHIN-6 
BETA (NT- 6 BETA) 
[Homo sapiens] 


257 


155/237 
(65%) 


159/237 
(66%) 


4e-71 


| NT6B HUMAN 


gi \ 17482866 |ref |XP 


similar to 
NEUROTROPHIC 6 
BETA (NT- 6 BETA) 
[Homo sapiens] 


210 


116/172 
(67%) 


i 120/172 
(69%) 


2e-52 


064951 .1 | 
(XM_064951) 


gi [462745 { spf P34134 


NEUROTROPHIC 6 
GAMMA (NT- 6 
GAMMA) [Homo 
sapiens] 


186 


112/165 
(67%) 


119/165 
(71%) 


4e-49 


j NT6G HUMAN 


gi J 5453808 |ref | NP 0 


neurotrophin 5 
preproprotein ; 
neurotrophic 
factor 5 ; 
neurotrophin 4 ; 

neurotrophic 
factor 4 [Homo 
sapiens] 


210 


99/165 
(60%) 


109/165 
(66%) 


9e-40 


06170 .1 | 
(NM 006179) 



5 The homology of these sequences is shown graphically in the ClustalW analysis shown 

in Table 1 8E. 



Table 18E ClustalW Analysis of NO VI 8 

1) NOV18(SEQIDNO:42) 

2) gi!462743 ls nlP34 1 32INT6A HUMAN NEUROTROPHIN-6 ALPHA (NT-6 ALPHA) [Homo sapiens] (SEQ 
ID NO: 177) 

3) eil462744lsp iP34133INT6B_HUMAN NEUROTROPHIN-6 BETA (NT-6 BETA) [Homo sapiens] (SEQ ID 
NO;178) 

4) gi]174S2866ir ef)XP 064951.11 (XM_064951) similar to NEUROTROPHIN-6 BETA (NT-6 BETA) [Homo 
sapiens] (SEQ ID NO: 179) 

5) gil462745)sp|P34134lNT6G HUMAN NEUROTROPHIN-6 GAMMA (NT-6 GAMMA) [Homo sapiens] 
(SEQ ID NO: 180) 

6) gi!5453808 |reflNP _ 006 170JJ (NMJ)06179) neurotrophin 5 preproprotein; neurotrophic factor 5; 
neurotrophin 4; neurotrophic factor 4 [Homo sapiens] (SEQ ID NO: 181) 

10 20 30 40 50 60 70 

....|....|.... )....)... .|....|. } ].... j....) j.. i •■..li-.^l 

gi | 462743 j SKGFPIIlAGRPPLGFPTSDPTEVFILFSNPS^F^SVCSERCSERCSHffip^PHPgHgPgPC^N 
gi j 4 62 744 | SKGFPIIJ^GRPPI^FPTSDPTEVFIFFPNPS^FgVSMCSERCSERCSC^r^BpHpSHgpgpcSl 
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gi}l7482866 } 

gi|462745| 

gi|5453808| 



NOV18 
gi|462743 | 
gi I 462744 | 
gi |1748286G | 
gij462745| 
gi j&4S3808i 



NOV18 
gi|462743| 
gi{462744| 
gi | 17482866 | 
gi (462745) 
'gi 5453808] 



NOV18 

gi | 462743 | 

gi | 462744 j 

gi | 17482866 | 

gi|462745l 

gi j 5453808 | 



MjJjL gQAgSASgsgssSPSjlgE 

MjJpLg gcsEQlLbLFLLgs VPIE - S 

100 110 120 ■ 130 140 



210 




Tables 18F and 18G list the domain description from'DOMAIN analysis results against 
NOV18. This indicates that the N0V18 sequence has properties similar to those of other 
proteins known to contain these domains. 



Table 18F. Domain Analysis of NO VI 8 

gnl | Pfam|pfam00243 , NGF, Nerve growth factor family. (SEQ ID NO: 182) 
Length = 117 residues, 81.2% aligned 

Score = 70.9 bits (172), Expect - 8_e-14 



NOV18 : 



00243: 



13 4 ASHQGELAVCDAVTGWVTDPRTAVDSGVLEVEVLGEVPAAGGSSLCQHFFVT- CFEANNS 192 

ii +IM+IM+I+ mi mi + + i i+ii i i 

3 ASRRGELS VCDSVSVWVTDKTTAVD - 1 RGKEVTVLGEVNTNNGP1»KQYFFETKCKPPGPV 6 1 



NOV18: 193 EEGGPGVGGGAAAGWTGGHWVSECKAKQSYVRALTADAQGCVDW 237 

ii mi i+miii n i i 

00243 : 62 GS GCRGIDKRHWNSECKTTQTYVRALTMDANKRVGW 97 



Table 18G. Domain Analysis of NO VI 8 

gnl | Smart ] smart00140 , NGF , Nerve growth factor (NGF or beta-NGF) ; NGF 
is important for the development and maintenance of the sympathetic 
and sensory nervous systems. (SEQ ID NO: 183) 
Length = 106 residues, 88.7% aligned 

Score - 68.2 bits (165), Expect ~ 5e~13 



NOV18: 135 SHOGELAVCDAVTGWVTDPRTAVX»SGVLEVEVLGEVPAAGGSSLCQHFFVT-CFEANNSE 193 

ii+ii +MI+I+ mi ii i + + i i+ii i i i + 

0014 0: 1 SHRGEYSVCDSVSVWTDKTTATD-ISGKEVTVLGEVPVKNGPIiKQYFFETRCKSPNPVK 59 
NOV18: 194 EGGPGVGGGAAAGVWTGGHWVSECKAKQSYVRALTADAQGCVDW 237 

II I I l+IIIIIMI I I 

00140: 60 S GCRGIDSRHWNSHCTTTQTYVRALTSDANQRVGW 94 
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The neurotrophics, a family of mammalian neuronal survival and differentiation 
factors, are basic proteins that are proteolytically cleaved at a dibasic cleavage site to yield a 
mature protein with 6 invariant cysteine residues. Berkemeier et ah identified human genomic 
5 clones corresponding to neurotrophic 5 (NTS; and 3 related genes that they designated NT6- 
alpha, -beta, and -gamma They also isolated a partial NT6-'aIpha cDNA. The NTF6-alpha open 
reading frame encodes a predicted 255-amino acid protein that lacks an initiation codon. . 
Although the NT6-alpha DNA sequence is more than 90% identical to that of NT5, homology 
between the 2 proteins begins at amino acid 74 of NT6-alpha because the 2 proteins utilize 

10 different reading frames. The C-terminal putative mature half of NT6-alpha shares 75% and 
45% identity to those of NT5 and nerve growth factor respectively. Berkemeier et al. (1992) 
suggested that NT6-alpha may be a pseudogene because they were' unable to identify an 
initiation codon, and because the .deduced protein is acidic and lacks the dibasic cleavage site 
and 2 of the 6 conserved cysteines that are characteristic of neurotrophic. Independently, Ip et 

15 al. (1992) cloned genomic segments of the same gene, which they considered to be an NT4 
pseudogene. By analysis of somatic cell hybrids, Berkemeier et al. (1992) mapped the NT5 
and NT6 genes to human chromosome 19. Using Southern blot analysis, they detennined that 
NT6-related genes are present in monkey and cow, but not in rodents. By FISH, Ip et al. 
(1992) mapped both the NT4 and the NT4 pseudogene to 19ql3.3, suggesting that the 

20 pseudogene arose via a gene duplication event. 

The above defined information for NOV18 suggests that this NOV18 protein may 
function as a member of a Neurotrophin-6 alpha protein family. Therefore, the NOV1 8 nucleic 
acids and proteins of the invention are useful in potential therapeutic and diagnostic 
applications. For example, a cDNA encoding the NOV18 protein may be useful in gene 

25 therapy, and the NOV1 8 protein may be useful when administered to a subject in need thereof. 
By way of nonlimiting example, the compositions of the present invention will have efficacy 
for treatment of patients suffering from Fertility, Endometriosis, bone metabolism disorder; 
proinflammatory disorder; immune disorder; inflammatory disease; septic shock; stroke; 
diabetes; arthritis; intercolitis; pneumonitis; epithelial cell; skin disease; proliferative disorder; 

30 skin cancer; melanoma; Kaposi's sarcoma; epithelial cancer; squamous cell carcinoma; bone 
resorption disorder; osteoporosis; Paget's disease; osteoarthritis; degenerative arthritis; 
osteogenesis imperfecta; fibrous displasia; hypophosphatasia; bone sarcoma; myeloma bone 
disorder; osteolytic bone lesion; hypercalcemia; bone mass; bone fragility; bone pain; bone 
deformity and/or bone fracture. The NOV1 8 nucleic acid encoding Neurotrophin-6 alpha-like 
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protein, and the Neurotrophin-6 alpha-like protein of the invention, or fragments thereof, may 
farther be useful in diagnostic applications, wherein the presence or amount of the nucleic acid 
or the protein are to be assessed. ' 

NOV19 



NOV 19 includes three novel Methionin Aminopeptidase-like proteins disclosed below. 

The disclosed proteins have been named NOV 19a, NOV 1 9b and NOV 19c. 

i 1 

NOV19a 



A disclosed NOV19a nucleic acid of 1018 nucleotides (designated CuraGen Acc. No. 
C(j502 87-02) encoding a novel Methionin Aminopeptidase-like protein is shown in Table 
19A. An open reading frame was identified beginning with an TCA codon at nucleotides 56- 
58 and ending with a TGA codon at nucleotides 1001-1003. A putative untranslated region 
downstream from the termination codon is underlined in Table 19 A, and the start and stop 
codons are in bold letters. 



Table 19A. NOV19a Nucleotide Sequence (SEQ ID NO:43) 



GGCG CCC AGCGGCTTGCACCTGTTCGTCCGAAGAGGTTGTCATAGGATTTTCTGA Y CACCACTCAATCATATCTACTTA 
CACAAGCAGTCAAGCAGTCAACAAAGAAGAAATTTCTTTTTTCGGAGACAAAGAGATATTTCACACAGTATAGTTTTGC 
CGGCTGCAGTTTCTTCAGCTCATCCGGTTCCTAAGCACATAAAGAAGCCAGACTATGTGACGACAGGCATTGTACCAGA 
CTGGGGAGACAGCATAGAAGTTAAGAATGAAGATCAGATTCAAGGGCTTCATCAGGCTTGTCAGCTGGCCCGCCACGTC 
CTCCTCTTGGCTGGGAAGAGTTTAAAGGTTGACATGACAACTGAAGAGATAGATGCTCTTGTTCATCGGGAAATCATCA 
GTCATAATGCCTATCCCTCACCTCTAGGCTATGGAGGTTTTCCAAAATCTGTTTGTACCTCTGTAAACAACGTGCTCTG 
TCATGGTATTCCTGACAGTCGACCTCTTCAGGATGGAGATATTATCAACATTGATGTCACAGTCTATTACAATGGCTAC 
CATGGAGACACCTCTGAAACATTTTTGGTGGGCAATGTGGACGAATGTGGTAAAAAGTTAGTGGAGGTTGCCAGGAGGT 
GTAGAGATGAAGCAATTGCAGCTTGCAGAGCAGGGGCTCCCTTCTCTGTAATTGGAAACACAATCAGCCACATAACTCA 
TCAGAATGGTTTTCAAGTCTGTCCACATTTTGTGGGACATGGAATAGGATCTTACTTTCATGGACATCCAGAAATTTGG 
CATCATGCAAACGACAGTGATCTACCCATGGAGGAGGGCATGGCATTCACTATAGAGCCAATCATCACGGAGGGATCCC 
CTGAATTTAAAGTCCTGGAGGATGCATGGACTGTGGTCTCCCTAGACAATCAAAGGTCGGCGCAGTTCGAGCACACGGT 
TCTGATCACGTCGAGGGGCGCGOAGATCCTGACCAAACTACCCCATGAGGCCTGAGGAGCCGCCCGAAGG 



The nucleic acid sequence of NOV1 9a has 469 of 805 bases (58%) identical to a 
Saccharomyces cerevisiae methionine aminopeptidase I (MAPI) mJRNA (gbrGENBANK- 
ID:YSCMAPlA|acc:M77092.1) (E - Lie" 20 ). 

A NOV 19a polypeptide (SEQ ID NO:44) encoded by SEQ ID NO:43 is 315 amino 
acid residues and is presented using the one letter code in Table 19B. Signal P, Psort and/or 
Hydropathy results predict that NOV 19a contains a signal peptide and is likely to be localized 
in the cytoplasm with a certainty of 0.450. 



Table 19B. NOV19a protein sequence (SEQ ID NO:44) 

^plnhiylhkqsssqqrrnfffrr^^ishsivlpaavssahWpkhikkpd^yvttgivpdwgdsievknedqiqglhqacqlT 
arhvlllagkslkvdmtteeidalvhreiishnaypsplgyggfpksv^ 

hgdtsetflvgnvdecgkklvevarrcrdeaiaacragapfsvigntishithqngfqvcphfvghgigsyfhghpeiwhhan 

DSDLPMEEGMAFTI EPIITEGS PEFKVLEDAWTW5LDNQRSA QFEH TVIjITSRGAQI LTKL PHEA 
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15 



20 



The NOV19a amino acid sequence has 143 of 267 amino acid residues (53%) identical 
to, and 173 of 267 amino acid residues (64%) similar to, aArabidopsis thaliana 369 amino 
acid residue putative methionine' aminopeptidase (ptnr:TREMBLNEW-ACC:AAO09564) (E = 
5.7e" 53 ). 

NOV 19a is expressed in at least the following tissues: Adrenal Gland/Suprarenal 
gland, Artery, Bone Marrow, Brain, Colon, Pituitary Glandand Whole Organism. This 
information was derived by determining the tissue sources of the sequences that were ihcluded 
in the invention including but not limited to SeqCalling sources, Public EST sources, 
Literature sources, and/or RACE SQurces. 



10 NOV19b 



A disclosed NOV19b nucleic acid of 1018 nucleotides (designated CuraGen Acc. No. 
CG50287-01) encoding a novel Methionin Amihopeptidase-like protein is shown in Table 
19C. An open reading frame was identified beginning with an ATC codon at nucleotides 71- 
73 and ending with a TGA codon at nucleotides 994-996. A putative untranslated region 
downstream from the termination codon is underlined in Table 19C, and the start and stop 
codons are in bold letters. 



Table 19C. NOV19b Nucleotide Sequence (SEQ ID NO:45) 



GGCGCCCAGCGGCTTGCACCTGTTCGTCCGAAGAGGTTGTCATAGGATTTTCTGATCACCACTCAATCAT ATCTACTTA 
CACAAGCAGTCAAGCAGTCAACAAAGAAGAAATTTCTTTTTTCGGAGACAAAGAGATATTTCACACAGTATAGTTTTGC 
CGGCTGCAGTTTCTTCAGCTCATCCGGTTCCTAAGCACATAAAGAAGCCAGACTATGTGACGACAGGCATTGTACCAGA 
CTGGGGAGACAGCATAGAAGTTAAGAATGAAGATCAGATTCAAGGGCTTCATCAGGCTTGTCAGCTGGCCCGCCACGTC 
CTCCTCTTGGCTGGGAAGAGTTTAAAGGTTGACATGACAACTGAAGAGATAGATGCTCTTGTTCATCGGGAAATCATCA 
GTCATAATGCCTATCCCTCACCTCTAGGCTATGGAGGTTTTCCAAAATCTGTTTGTACCTCTGTAAACAACGTGCTCTG 
TCATGGTATTCCTGACAGTCGACCTCTTCAGGATGGAGATATTATCAACATTGAfGTCACAGTCTATTACAATGGCTAC 
CATGGAGACACCTCTGAJ^CATTTTTGGTGGGCAATGTGGACGJ^TGTGGTAAAAAGTTAGTGGAGGTTGCCAGGAGGT 
GTAGAGATG AAGC AATTG C AG CTTGCAG AGCAGGGG CTCCCTTCTCTGT AATTGG AAAC ACAATCAGC CA CATAACT CA 
TCAGAATGGTTTTCAAGTCTGTCCACATTTTGTGGGACATGGAATAGGATCTTACTTTCATGGACATCCAGAAATTTGG 
CATCATGCAAACGACAGTGATCTACCCATGGAGGAGGGCATGGCATTCACTATAGAGCCAATCATCACGGAGGGATCCC 
CTGAATTTAAAGTCCTGGAGGATGCATGGACTGTGGTCTCCCTAGACAATCAAAGGTCGGCGCAGTTCGAGCACACGGT 
TCTGATCACGTCGAGGGGCGCGCAGATCCTGACCAAACTACCCCATGAGGCCTGAGGAGCCGCCCGAAGG 



A NOV19b polypeptide (SEQ ID NO:46) encoded by SEQ ID NO:45 is 310 amino 
acid residues and is presented using the one letter code in Table 19D. 



Table 19D. NOV19b protein sequence (SEQ ID NO;46) 



IYLHKQSSSQQRRNPFFRRQRDISHSIVLPAAVSSAHPVPKHIKKPDYVTTGIVPDWGDSIEVKNEDQIQGLHQACQIiARHVIj 
LLAGKSIjKVDMTTEEIDAbVHREI I SHNAYPSPLGYGGFPKSVCTSVNNVbCHGI PDSRPLQDGDIINIDVTVYYNGYHGDTS 
ETPIiVG^^WDECGKKLVEVARRCRI3EAIAACRAGAPFSVIGNTISHITHONGFQVCPHFVGHGIGSYFHGHPEIWHHANDSDI J P 
MEEGMAFTIEPIITEGSPEFKVLEDAWTWSLDNQRSAQFEHTVLITSRGAQILTKLPHEA 



NOV19c 

A disclosed NOV19c nucleic acid of 3 042 nucleotides (designated CuraGen Acc. No. 
CG50287-03) encoding a novel Methionin Aminopeptidase-like protein is shown in Table 
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19E. An open reading frame was identified beginning with an ATG codon at nucleotides 21- 

23 and ending with a TGA codon at nucleotides 1026-1028. A putative untranslated region 

• | . f 

downstream from the termination codon is underlined in Table 19E, and the start and stop 

codons are in bold letters. 



Table 19E. NOV19c Nucleotide Sequence (SEQ ID NO:47) 



gccacgtg accgacgccaac atggcggcgcccagtggcgtccacctgctcgtccgcagaggttctcatagaattttl'ctc 
ttcaccactcaatcatatctacttacacaagcagtcaagcagtcaacaaagaagaaatttcttttttcggagacaaaga 
gatatttcacacagtatagtttcgccggctgcagtttcttcagctcatccggttcctaagcacataaagaagccagact 
atgtgacgacaggcattgtaccagactggggagacagcatagaagttaagaatgaagatcagattcaagggcttcatca 
ggcttgtcagctggcccgccacgtcctcctcttggctgggaagagtttaaaggttgacatgacaactgaagagatagat 
gctcttgttcatcgggaaatcatcagtcataatgcctatccctcacctctaggctatggaggttttccaaaatctgttt 
gtacctctcJtaaacaacgtgctctgtcatggtattcctgacagtcgacctcttcaggatggagatattatcaacattga 
tgtcacagtctattacaatggctaccatggagacacctctgaaacatttttggtgggcaatgtggacgaatgtggtaaa 
aagttagtggaggttgccaggaggtgtagagatgaagcaattgcagcttgcagagcaggggctcccttctctgtaattg 
gaaacacaatcagccacataactcatcagaatggttttcaagtctgtccacattttgtgggacatggaataggatctta 

CTTTCATGGACATCCAGAAATTTGGCATCATGCA7\ACGACAGTGATCTACCCATGGAGGAGGGCATGGCATTCACTATA 

gagccaatcatcacggagggatcccctgaatttaaagtcctggaggatgcatggactgtggtctccctagacaatcaaa 
ggtcggcgcagttcgagcacacggttctgatcacgtcgaggggcgcgcagjitcctgaccaaactaccccatgaggcctg 
aggagccgcccgaag 



10 



15 



20 



The nucleic acid sequence of NOV19c maps to chromosome 2 and has 532 of 866 
bases (61%) identical to a Arabidopsis thaliana methionine aminopeptidase mRNA 
(gb:GENBAMK-ID:AF250963]acc:AF250963.1) (E = 3.5e" 36 ). 

A NOV19c polypeptide (SEQ ID NO:48) encoded by SEQ ID NO:47 is 335 amino 
acid residues and is presented using the one letter code in Table 19F. Signal P, Psort and/or 
Hydropathy results predict thatNOV19c contains a signal peptide and is likely to be localized 
in the mitochondrial matrix space with a certainty of 0. 4760. 



Table 19F. NOV19c protein sequence (SEQ ID NO:48) 



WAAPSGVHLLVRRGSHRIFSSPLNHIYLHKQSSSQQRRNFPPRRQRDISHSIVSPAAVSSAHPVPKHIKKPDYVTTGIVPDWG 
DSIEVKNEDQIQGbHQACOIARHVLLLAGKSLKVDMTTEEIDALVHREIISHNAYPSPLGYGGFPKSVCTSVNNVLCHGIPDS 
RPLQDGDIINIDVTVYYNGYHGDTSETFLVGNVDECGKKLVEVARRCRDEAIAACRAGAPFSVIGNTISHITHQNGFQVCPHF 
VGHGIGSYFHGHPEIWHHANDSDLPMEEGMAFTIEPIITEGSPEFKVLEDAWTWSLDNQRSAQFEHTVIilTSRGAQILTKLF 
HEA 



TheNOV19c amino acid sequence has 306 of 335 amino acid residues (91%) identical 
to, and 315 of 335 amino acid residues (94%) similar to, a Mus musculus 335 amino acid 
residue 23 1 0066F24RTK protein (ptnr:SPTREl\4BL-ACC:Q9CPW9) (E = 4.4e" 166 ). 

NOV19c is expressed in at least the following tissues: Mammalian Tissue, Artery, 
Adrenal Gland/Suprarenal gland, Colon, Bone Marrow, Brain, Pituitary Gland and Placenta. 
This infonnation was derived by determining the tissue sources of the sequences that were 
included in the invention including but not limited to SeqCalling sources, Public EST sources, 
Literature sources, and/or RACE sources. 

Possible small nucleotide polymorphisms (SNPs) found for NOV19a are listed in 
Table 19G. 
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Table 19G: SNPs 


Variant 


Nucleotide 


Base' 


Amino 


Base 




Position 


Change 


Acid 
Position 


Change 


13376475 


79 


C>T 


Silent 


N/A 


13376474 


330 


T>C 


105 


He > Thr 


13376473 


336 


A > G 


107 


Asn > Ser 


13376472 


472 


A >r G 


Silent 


N/A 


13376471 


764 


G > A 


250 


Val > Met 



NOV 19a - NOV19c are very closely homologous as is shown in the amino acid 
alignment in Table 19H. 

Table 19H Amino Acid Alignment of NOV19a - NOV19c 

20 30 40 50 60 70 
. I I I I i I I I 



NOV19a 
NOV19b 



NOV19a 
NOV1 9b 
NOV19C 



NOV19a 
NOV19b 
NOV19C 



NOV19a 
NOV19b 
NOV19C 



NOV19a 
NOV19b 
NO VI 9c 



|....|. 



SB 



FFPRRQRDISHSI VLPAAVSSAHPVPKHI 
PFPRRQRDISHSIVI»PAaVSSAHPVPKHlKF 
FFFRRQRDISHS I vSPAAVSSAHPVPKHIK* 



80 



I 



90 



100 110 120 

..I 



PDYVTTG t VPD WGDS I EVKffiEDQ I QGLJ3Q ACQLARHVIiljLiAG KSLKVDMTTEE IDALVHRE 1 I SHigAYP S 
PDYVTTOt VPDWGDS lEWKgEDQ IQGLHQACQLARH^^ 1 I ShSaYP S 

PDYVTTGIVPDWGDSXEVkSeDQIQGLHQACQIJU*HVI;LI^ 



130 140 

HT 



150 



160 



I 



170 



180 190 

. . I . . . . I . . . . I . . 



PLGYGGFPKSVCTS 
PLGYGGFPKSVCTS 
PLGYGGFPKSVCTSVT5 



200 210 
. JI....1....1 



HGI PDSRPLQDGDI I SlDVTVyV^YHGDTSETFLVGjSVDECGKKLVEV 
GIPDSRPLQDGDI I BlDVTVYYSGYHGDTSETFLVGSvDECGKKLVEVj 
aVLCHGIPDSRPIjQDGDI I BiDVTVYVIGYHGDTSETFLVgSvDECSKKLVKV 



220 230 240 
■ . 1 1 | | | . . 



250 
..|.. 



RRCRDEAIAACRAGAPFSVIGfflTISHITOQSGFQVCPHFVGHGIGSYFHGHPEIWI 
RRCRDEAI AACRAGAPFSVI G<St ISHITH^GFQVCPHFVGHGIGS YFHGHPE IWH 

rrcrdeaiaacragapfsvigBtishith3Sgfqvcphfvghgigs yfhghpex^ 




290 300 

.1.. ..)....)....).. 



310 320 330 

.. I .... I .... I * ... I I . . 



FTXE PI I TEGS PEFKVLEDAWTWSLDgQRSAQFEHTVLITSRGAQI LTKLPH! 
FTIEPIITEGSPEFKVIjEDAWTVVSLdSqRSAQFEHTVLITSRGAQILTKIiPH 
FTIEPI ITEGS PEFKVLBDAWTWSLdSqRSAQFEHTVLITSRGAQILTKLPHE * 



10 



Homologies to any of the above NOV 19 proteins will be shared by the other NOV 19 
proteins insofar as they are homologous to each other as shown above. Any reference to 
NOV 19 is assumed to refer to any of the NOV19 proteins in general, unless otherwise noted. 

NOV 19a also has homology to the amino acid sequences shown in the BLASTP data 
listed in Table 191. 



Table 19L BLAST results for NOV} 9a 


Gene Index/ 
Identifier 


Protein/ Organism 


Length 
(aa) 


Identity 
(%) 


Positives 
(%) 


Expect 
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gi 1 17975502 | ref |NP 


methionine 
aminopeptidase- 
like 1 [Mus 
musculus] 


3 3 5 


2 /y/ Jib 
(88%) 


286/315 
(90%) 


e-162 


079909.1 | 
(NM_025633) 


o,i 1 1598223 6 jemblCAC 


putative 
methionyl 
aminopept idase 
(Mus musculus] 


217 


204/217 
( 94%) 


207/217 
(95%) 


e-117 


88860. l| (AJ414378) 


gi|l5222218|ref |NP 


methionine 
ammopeptidase I 
(MAPI) , putative 
[Arabidopsis 
thaliana] 


369 


143/267 

IK "3. Sir \ 


173/267 


5e-77 


1 /278b . 1 | 
(NC_003070) 


gi | 11320956 |gb|AAG3 
3975 . 1 f AF250961 1 
(AF250961) 


methionine 
aminopeptidase- 
like protein 
[Arabidopsis 
thaliana] 


369 


143/267 
(53%) 


174/267 
(64%) . 


8e-77 

i 


gi | 7297697 | gb J AAF52 


CG5'18 8 gene 

product 
tDrosophila 
melanogaster] 


307 


145/282 
(51%) 


181/282 
(63%) 


3e-76 


949.1} (AE003628) 



The homology of these sequences is shown graphically in the ClustalW analysis shown 
in Table 191 

Table 193 ClustalW Analysis of NOV19a 

1) NOV19a (SEQ ID NO:44) 

2) eill7975502lreflNP 079909.11 (NMJ)25633) methionine aminopeptidase-like 1 [Mus musculus] (SEQ ID 
NO: 184) 

3) gill5982236lemblCAC88S60.il (AI414378) putative methionyl aminopeptidase [Mus musculus] (SEQ ID 
N0:1S5) 

4) gill 522221 8[ref)NP 172785.11 (NCJ)03070) methionine aminopeptidase I (MAPI), putative [Arabidopsis 
thaliana] (SEQ ID NO: 186) 

5) gill 1320956igblAAG33975.1lAJF250961 1 (AF250961) methionine aminopeptidase-like protein 
[Arabidopsis thaliana] (SEQ ID NO: 187) 

6) gi |72976971 g b|AAF52949 t ll (AE003628) CG5188 gene product [Drosophila melanogaster] (SEQ ID 
NO: 188) 



NOVl9a 



Si 

gi 

gi 



17975502 | 
15982236 j 
15222218| 
11320956 | 
7297697) 



50 



I I I I I — • I * ' ' ♦ I I v..:-' I/.- ■ JL* V • I * • * * I 

_ ^sJl^HIY^KQSSSQOJ^FFPR^RDIgHSIgL- 

-MAAPIGVPLLVRGGCQRIlisS@I^HIY^KRSGSQ 



massvflsspssssslqlcssfhgeyi^psrcflg^gv^sssg 
massvflssfssssslqlcssfhgeylapsrcfrgastysssj 
mg'sglslsgi! 



;SLSG KKN&YSP^FH 

jSLSG KKNSYSPpFH 

|YSDG '- - - -Y;TYgDTGKYEQl2 



NOVl9a 

gi{l7975502( 

gi|l5982236j 

gijlS222218| 

gi| 11320956 | 

gi|7297697| 



.|....|. 



90 



100 



110 



120 



- PAA^SAHFBgKHHKK 



130 



iTjigivg- gWGDS lEVKNEPOgQGiU 
^IVg-^GDSIjEVKDEppgOGki 



LEEAlRIRKMRELETKSKWRNPPLRRGRflffiRLLjJgDHppj; 

tgopHSerfBSeei 



|P^ES^xS-SlSSEFbl : Pt3P^G^KM 
3 p B E !?SvLg-S ISSEF 'Q T ! PGp EGHAJ<M 

fkn^'pSgntlgs p'eiIksqvqsdam 



N0V19a 

gijl7975502 j 
gi|l5982236 j 
gijl5222218| 
gi]ll320956] 
gi|7297697j 



150 



160 



170 



180 



190 



200 



220 



210 




250 



260 
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NOV19a 




NOV19a 
gi | 17975502 | 
gi j 15982236 | 
gi j 15222218 j 
gi j 11320956 | 
gi|7297697j 



NOV19a 
gi j 17975502 
gi [15982236 
gi j 15222218 
gi j 11320956 | 
gi j 7297697 [ 



CDEHDLAS I AAQlj 




Table 19K lists the domain description from DOMAIN analysis results against 
NOV19a. This indicates that the NOV19a sequence has properties similar to those of other 
proteins known to contain these domains. 



Table 19K. Domain Analysis of NOV19a 

gnl 1 Pf amjpf am00557 , Peptidase_M24 , metallopeptidase family M24 (SEQ ID 
NO: 189) 

Length = 243 residues, 98.8% aligned 

Score = 179 bits (454), Expect = 2e-46 



NOV19a: 


68 


00557 : 


2 


NOV19a: 


125 


00557 : 


62 


NOV 19a : 


185 


00557: 


120 


NOV19a: 


243 


00557: 


180 


NOV1 9a : 


303 


00557 : 


240 



VKNEDQIQGbHQACQbARHVbbbAGKSbKVDMTTEEIDAbVHREI ishn-aypsplgy- - 
+K+ + + I+ + +A + +AR V ++ + K MT EI + I A P+ GY 

IKSPEEI EKMRKAGEI ARRVHRAWEAIKPGMTELEIAEEI EYAIRKRGGADPAPYGYIV 

GGFPKSVCTSVNNVbCHGI PDSRPLQDGDI INIDVTVYYNGYHGDTSETFLVGNVDECGK 
GFP S+ SVN + H PD R b+DGDI-f ID Y+GYHGD + TF VG 4 
IGFPTSI - - SVNEAVAHYSPDDRVLKDGDI VLIDAGAEYDGYHGDI ARTFPVGKPTPDAR 



124 



61 



184 



119 



242 



KLVEVARRCRDEAIAACRAGAPFSVIGNTI SHI THONG- - FQVCPHFVGHGIGS YFHGHP 
Kb E ++ AI A + G SI 1+ + H +GHGIG H P 

KLYE AVLE AQEAAI EAI KPGNTbSD I HAA' I QKVAESELGQ CKP VRHGbGHG I GLDVHDVP 179 

EIWHHANDSDbPMEEGMAFTIEPIITEGSPEFKVbEDAWTWSbDNORSAQFEHTVLITS 3 02 

+ + +EEGM FTIEP + G + DW+D SQET++T 

GVPQYDRGDTRVLEEGMVFTIEPGVYFGGVPGRTRGDGWVRIEDDIWSEQGEETbTVTP 23 9 



Methionine aminopeptidase (MetAP-2) cleaves the N-terminal methionine from newly 
synthesized NMT protein substrates. The inhibition of new blood vessel formation 
(angiogenesis) is an effective means of limiting both the size and metastasis of solid tumors. 
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One of the leading angiogenic compounds, fumagillin and its derivative TNP-470, inhibits 

n "* »■ ■ , ' ■ 

neovascularization via endothelial cell cycle arrest in the late 

I t P 

Gl phase. Because TNP-470 proved tq be effective in in vitro and animal model 
studies, phase III antitumor clinical trials were initiated. To investigate the molecular mode of 
5 action of TNP-470, Sin et al. (Sin et al., Proc. Nat. Acad. Sci. 94: 6099-6103, 1997. PubMed 
ID ; 9177176) used a derivative of the TNP-470 parent compound, the fungal metabolite 
fumagillin, and purified a mammlalian protein from bovine brain lys ate that is selectively arid 

covalently bound by this natural product. This fumagillin binding protein was found to be a 

i 

metalloprotease, methionine aminopeptidase-2 (MetAP2), which is highly conserved between 
10 human and |Saccharomyces cerevisiae. In the absence of MetAPl, a distantly related 

methionine aminopeptidase, MetAP2 function is essential for vegetative growth in yeast. Sin 
et al. (1997) demonstrated that fumagillin selectively inhibits the S. cerevisiae MetAP2 protein 

in vivo. The binding is highly specific as judged by the failure of fumagillin to inhibit MetAPl 

i 

in vivo. 

1 5 Fumagillin and ovalicin are structurally related natural products that potently inhibit 

angiogenesis by blocking endothelial cell proliferation. To determine the structural elements 
of these inhibitors and methionine aminopeptidase-2 that are involved in this interaction, 
Griffith et al. (Griffith et al., Proc. Nat. Acad. Sci. 95: 15183-15188, 1998. PubMed ID : 
9860943) studied various fumagillin analogs with structural changes. Their results suggested 

20 that fumagillin and ovalicin inhibit MetAP2 by irreversible blockage of the active site. 

Therefore, the novel protein described in this invention is apotential small molecule drug 
target in tumor angiogenesis. 

The N-terminal protein processing pathway is an essential mechanism found in all 
organisms. However, it is widely believed that deformylase, a key enzyme involved in this 

25 process in bacteria, does not exist in eukaryotes, thus making it a target for antibacterial agents 
such as actinonin. In an attempt to define this process in higher eukaryotes Giglione et al. 
(EMBO J 2000;19:5916-29) have used Arabidopsis thaliana as a model organism. Two 
deformylase cDNAs, the first identified in any eukaryotic system, and six distinct methionine 
aminopeptidase cDNAs were cloned. The corresponding proteins were characterized in vivo 

30 and in vitro. Methionine aminopeptidases were found in the cytoplasm and in the organelles, 
while deformylases were localized in the organelles only. Their work shows that higher plants 
have a much more complex machinery for methionine removal than previously suspected. 
They were also able to identify deformylase homologues from several animals and clone the 
corresponding cDNA from human cells. Their data provide the first evidence that lower and 
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higher eukaryotes, as well as bacteria, share a similar N-terminal protein processing 
machinery, indicating universality of this system. 

The above defined information for NOV19 suggests that this NOV19 protein may 
function as a member of a Methionin Ajninopeptidase protein family. Therefore, the NOV 19 
5 nucleic acids and proteins of the invention are useful in potential therapeutic and diagnostic 
applications. For example, a cDNA encoding thp NOV 19 protein may be useful in gene 
therapy, and the NOV1 9 protein may be useful when administered to a subject in need thereof. 
By way of nonlimiting example, the compositions of the present invention will have efficacy 
for treatment of patients suffering from adrenoleuko dystrophy, congenital adrenal hyperplasia, 

1 0 atherosclerosis, aneurysm, hypertension, fibromuscular dysplasia, scleroderma, 

transplantation, hemophilia, hypercoagulation, idiopathic thrombocytopenic purpura, 
1 autoimmume disease, allergies, immunodeficiencies, graft versus host disease, Von Hippel- 
Lindau (VHL) syndrome, Alzheimer's disease, stroke, tuberous sclerosis, hypercalcemia, 
Parkinson's disease, Huntington's disease, cerebral palsy, epilepsy, Lesch-Nyhan syndrome, 

15 multiple sclerosis, ataxia-telangiectasia, leukodystrophies, behavioral disorders, addiction, 
anxiety, pain, neuroprotection, Hirschsprung's disease, Crohn's Disease, appendicitis, 
endocrine dysfunctions, diabetes, obesity, growth and reproductive disorders. The NOV 19 
nucleic acid encoding Methionin Aminopeptidase-like protein, and the Methionin 
Aminopeptidase-like protein of the invention, or fragments thereof, may further be useful in 

20 diagnostic applications, wherein the presence or amount of the nucleic acid or the protein are 
to be assessed. 

NOVX Nucleic Acids and Polypeptides 

One aspect of the invention pertains to isolated nucleic acid molecules that encode 
NOVX polypeptides or biologically active portions thereof. Also included in the invention are 

25 nucleic acid fragments sufficient for use as hybridization probes to identify NO VX-encoding 
nucleic acids (e.g., NOVX mRNAs) and fragments for use as PCR primers for the 
amplification and/or mutation of NOVX nucleic acid molecules. As used herein, the term 
"nucleic acid molecule" is intended to include DNA molecules (e.g., cDNA or genomic 
DNA), RNA molecules (e.g., mRNA), analogs of the DNA or RNA generated using 

30 nucleotide analogs, and derivatives, fragments and homologs thereof. The nucleic acid 
molecule may be single- stranded or double-stranded, but preferably is comprised double- 
stranded DNA. 

An NOVX nucleic acid can encode a mature NOVX polypeptide. As used herein, a 
"mature" form of a polypeptide or protein disclosed in the present invention is the product of a 
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naturally occurring polypeptide or precursor form or proprotein. The naturally occurring 
polypeptide, precursor or proprotein includes, by way of nonlimiting example, the full-length 
gene product, encoded by the corresponding gene. Alternatively, it may be'defined as the 
polypeptide, precursor or proprotein encoded by an ORF described herein. The product 
5 "mature" form arises, again by way of nonlimiting example, as a result of one or more 
naturally occurring processing steps as they may take place within the cell, or host cell, in 
which the gene product arises. Examples 'of such processing steps leading to a "mature" form 
of a polypeptide or protein include the cleavage of the N-terminal methionine residue encoded 
by the initiation codon of an ORF, ,or the proteolytic cleavage of a signal peptide or leader 
10 sequence. Thus a mature form arising from a precursor polypeptide or protein that has 

residues 1 to N, where residue 1 is the N-terminal methionine, would have residues 2 through 
N remaining after removal of the N-terminal methionine. Alternatively, a mature form arising 
from a precursor polypeptide or protein having residues 1 to N, in which an N-terminal signal 
sequence horn residue 1 to residue M is cleaved, would have the residues from residue M-H to 

1 5 residue N remaining. Further as used herein, a "mature" form of a polypeptide or protein may 
arise from a step of post-translational modification other than a proteolytic cleavage event. 
Such additional processes include, by way of non -limiting example, glycosylation, 
myristoylation or phosphorylation. In general,- a mature polypeptide or protein may result 
from the operation of only one of these processes, or a combination of any of them. 

20 The term "probes", as utilized herein, refers to nucleic acid sequences of variable 

length, preferably between at least about 1 0 nucleotides (nt), 100 nt, or as many as 
approximately, e.g., 6,000 nt, depending upon the specific use. Probes are used in the 
detection of identical, similar, or complementary nucleic acid sequences. Longer length 
probes are generally obtained from a natural or recombinant source, are highly specific, and 

25 much slower to hybridize than shorter-length oligomer probes. Probes may be single- or 
double-stranded and designed to have specificity in PCR, membrane-based hybridization 
technologies, or ELISA-like technologies. 

The term "isolated" nucleic acid molecule, as utilized herein, is one, which is separated 
from other nucleic acid molecules which are present in the natural source of the nucleic acid. 

30 Preferably, an "isolated" nucleic acid is free of sequences which naturally flank the nucleic 
acid (i.e., sequences located at the 5'- and 3'-termini of the nucleic acid) in the genomic DNA 
of the organism from which the nucleic acid is derived. For example, in various embodiments, 
the isolated NOVX nucleic acid molecules can contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 
kb, 0.5 kb or 0.1 kb of nucleotide sequences which naturally flank the nucleic acid molecule in 
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genomic DNA of the cell/tissue from which the nucleic acid is derived (e.g., brain, heart, liver, 
spleen, etc.). Moreover, an "isolated" nucleic acid molecule, such as a cDNA molecule, can ■ 
be substantially free of other cellular material or culture medium when produced by 
recombinant techniques, or of chemical precursors or other chemicals when chemically 
5 synthesized. 

A nucleic acid molecule of the invention, e.g., a nucleic acid molecule having the 
nucleotide sequence SEQ ID NOS:l, 3, 5, 7, 9, 11,-13, 15, 17,'19, 21, 23, 25, 27, 29, 31, 33, 
34, 35, 37, 39, 41, 43, 45, 47 and 49, or a complement of this aforementioned nucleotide 
sequence, can be isolated using standard molecular biology techniques and the sequence 
10 information provided herein. Using all or a portion of the nucleic acid sequence of SEQ ID 
NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 34, 35, 37, 39, 41, 43, 45, 47 
and 49 as a hybridization probe, NOVX molecules can be isolated using standard 
hybridization and cloning techniques (e.g., as described in Sambrook, et aL, (eds.), 
Molecular Cloning: A Laboratory Manual 2 nd Ed., Cold Spring Harbor Laboratory 

15 Press, Cold Spring Harbor, NY, 1989; and Ausubel, et al., (eds.), Current PROTOCOLS IN 
Molecular Biology, John Wiley & Sons, New York, NY, 1993.) 

A nucleic acid of the invention can be amplified using cDNA, niRNA or alternatively, 
genomic DNA, as a template and appropriate oligonucleotide primers according to standard 
PCR amplification techniques. The nucleic acid so amplified can be cloned into an 

20 appropriate vector and characterized by DNA sequence analysis. Furthermore, 

oligonucleotides corresponding to NOVX nucleotide sequences can be prepared by standard 
synthetic techniques, e.g., using an automated DNA synthesizer. 

As used herein, the term "oligonucleotide" refers to a series of linked nucleotide 
residues, which oligonucleotide has a sufficient number of nucleotide bases to be used in a 

25 PCR reaction. A short oligonucleotide sequence may be based on, or designed from, a 
genomic or cDNA sequence and is used to amplify, confirm, or reveal the presence of an 
identical, similar or complementary DNA or RNA in a particular cell or tissue. 
Oligonucleotides comprise portions of a nucleic acid sequence having about 10 nt, 50 nt, or 
100 nt in length, preferably about 15 nt to 30 nt in length. In one embodiment of the 

30 invention, an oligonucleotide comprising a nucleic acid molecule less than 100 nt in length 

would further comprise at least 6 contiguous nucleotides SEQ ID NOSrl, 3, 5, 7, 9, 11, 13, 15, 
17, 19, 21 , 23, 25, 27, 29, 31, 33, 34, 35, 37, 39, 41, 43, 45, 47 and 49, or a complement 
thereof. Oligonucleotides may be chemically synthesized and may also be used as probes. 
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In another embodiment, an isolated nucleic acid molecule of the invention comprises a 
nucleic acid molecule that is a complement of the nucleotide sequence shown in SEQ ID 
NOS:l 5 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21,' 23, 25, 27, 29, 31, 33, H -35, 37, 39, 41, 43, 45, 47 
and 49, or a portion of this nucleotide sequence (e.g., a fragment that can be used as a probe or 
5 primer or a fragment encoding a biologically-active portion of an NOVX polypeptide). A 
nucleic acid molecule that is complementary to the nucleotide sequence shown SEQ ID 
NOS:l, 3, 5, 7, 9, 11,13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 34, 35, 37, 39, 41, 43, 45, 47 
and 49 is one that is sufficiently complementary to the nucleotide sequence shown NOSil, 3, 
5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23,' 25, 27, 29, 31, 33, 35, 37, 39 or 41 that it can hydrogen bond 
10 with little or no mismatches to the nucleotide sequence shown SEQ ID NOS:l, 3, 5, 7, 9, 11, 
13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 34, 35, 37, 39, 41, 43, 45, 47 and 49, thereby forming 
'a stable duplex. 

As used herein, the term "complementary" refers to Watson-Crick or Hoogsteen base 
pairing between nucleotides units of a nucleic acid molecule, and the term "binding" means 

1 5 the physical or chemical interaction between two polypeptides or compounds or associated 
polypeptides or compounds or combinations thereof. Binding includes ionic, non-ionic, van 
der Waals, hydrophobic interactions, and the like. A physical interaction can be either direct 
or indirect. Indirect interactions may be through or due to the effects of another polypeptide or 
compound. Direct binding refers to interactions that do not take place through, or due to, the 

20 effect of another polypeptide or compound, but instead are without other substantial chemical 
intermediates. 

Fragments provided herein are defined as sequences of at least 6 (contiguous) nucleic 
acids or at least 4 (contiguous) amino acids, a length sufficient to allow for specific 
hybridization in the case of nucleic acids or for specific recognition of an epitope in the case of 

25 amino acids, respectively, and are at most some portion less than a full length sequence. 
Fragments may be derived from any contiguous portion of a nucleic acid or amino acid 
sequence of choice. Derivatives are nucleic acid sequences or amino acid sequences formed 
from the native compounds either directly or by modification or partial substitution. Analogs 
are nucleic acid sequences or amino acid sequences that have a structure similar to, but not 

30 identical to, the native compound but differs from it in respect to certain components or side 
chains. Analogs may be synthetic or from a different evolutionary origin and may have a 
similar or opposite metabolic activity compared to wild type. Homologs are nucleic acid 
sequences or amino acid sequences of a particular gene that are derived from different species. 
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Derivatives and analogs may be full length or other than full length, if the derivative or 
analog contains a modified nucleic acid or amino acid, as described below.. Derivatives or 
analogs of the nucleic acids or proteins of the invention include, but are ndt limiteid to, 
molecules comprising regions that are substantially homologous to the nucleic acids or 
5 proteins of the invention, in various embodiments, by at least about 70%, 80%, or 95% 

identity (with a preferred identity of 80-95%) over a nucleic acid or amino acid sequence of 
identical size or when compared to an aligned sequence 1 in which the alignment is done by- a 
computer homology program known in the art, or whose encoding nucleic acid is capable of 
hybridizing to' the complement of a.sequence encoding the aforementioned proteins under 

1 0 stringent, moderately stringent, or low stringent conditions. See e.g. Ausubel, et aL, CURRENT 
Protocols in Molecular Biology, John Wiley & Sons, New York, NY, 1993, and below. 

A "homologous nucleic acid sequence" c-r "homologous amino acid sequence," or 
variations thereof, refer to sequenqes characterized by a homology at the nucleotide level or 
amino acid level as discussed above. Homologous nucleotide sequences encode those 

15 sequences coding for isoforms of NOVX polypeptides. Isoforms can be expressed in different 
tissues of the same organism as a result of, for example, alternative splicing of RNA. 
Alternatively, isoforms can be encoded by different genes, In the invention, homologous 
nucleotide sequences include nucleotide sequences encoding for an NOVX polypeptide of 
species other than humans, including, but not limited to: vertebrates, and thus can include, e.g., 

20 frog, mouse, rat, rabbit, dog, cat cow, horse, and other organisms. Homologous nucleotide 
sequences also include, but are not limited to, naturally occurring allelic variations and 
inutations of the nucleotide sequences set forth herein. A homologous nucleotide sequence 
does not, however, include the exact nucleotide sequence encoding human NOVX protein. 
Homologous nucleic acid sequences include those nucleic acid sequences that encode 

25 conservative amino acid substitutions (see below) in SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 
19, 21, 23, 25, 27, 29, 31, 33, 34, 35, 37, 39, 41, 43, 45, 47 and 49, as well as a polypeptide 
possessing NOVX biological activity. Various biological activities of the NOVX proteins are 
described below. 

An NOVX polypeptide is encoded by the open reading frame ("ORF") of an NOVX 
30 nucleic acid. An ORF corresponds to a nucleotide sequence that could potentially be translated 
into a polypeptide. A stretch of nucleic acids comprising an ORP is uninterrupted by a stop 
codon. An ORP that represents the coding sequence for a full protein begins with an ATG 
"start" codon and terminates with one of the three "stop" codons, namely, TAA, TAG, or 
TGA. For the purposes of this invention, an ORF may be any part of a coding sequence, with 
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or without a start codon, a stop codon, or both. For an ORF to be considered as a good 
candidate for coding for a bona fi.de cellular protein, a minimum size requirement is often set, 
e.g., a stretch of DNA that would encode a protein of 50 amino acids or mofe. 

The nucleotide sequences determined from the cloning of the human NOVX genes 
5 allows for the generation of probes and primers designed for use in identifying and/or cloning 
NOVX homologues in other cell types, e.g. from other tissues, as well as NOVX homologues 
from other vertebrates. The prot}e/primer typically comprises substantially purified ' ' 
oligonucleotide. The oligonucleotide typically comprises a region of nucleotide sequence that 
hybridizes under stringent conditions to at least about 12, 25, 50, 100, 150, 200, 250, 300, 350 

10 or 400 consecutive sense strand nucleotide sequence SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 
19, 21, 23, 25, 27, 29, 31, 33, 34, 35, 37, 39, 41, 43, 45, 47 and 49; or an anti-sense strand 
nucleotide sequence of SEQ ID NOS:l, 3, 5, 7,9, 11, 13, 15, 17, \9 9 21, 23, 25, 27, 29, 31,33, 
34, 35, 37, 39, 41, 43, 45, 47 and 49; or of a naturally occurring mutant of SEQ ID NOS:l, 3, 
5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 34, 35, 37, 3&, 41, 43, 45,' 47 and 49. 

1 5 Probes based on the human NOVX nucleotide sequences can be used to detect 

transcripts or genomic sequences encoding the same or homologous proteins. In various 
embodiments, the probe further comprises a label group attached thereto, e.g. the label group 
can be a radioisotope, a fluorescent compound,, an enzyme, or an enzyme co^factor. Such 
probes can be used as a part of a diagnostic test kit for identifying cells or tissues which mis- 

20 express an NOVX protein, such as by measuring a level of an NOVX-encoding nucleic acid in 
a sample of cells from a subject e.g., detecting NOVX mRNA levels or determining whether a 
genomic NOVX gene has been mutated, or deleted. 

"A polypeptide having a biologically-active portion of an NOVX polypeptide" refers 
to polypeptides exhibiting activity similar, but not necessarily identical to, an activity of a 

25 polypeptide of the invention, including mature forms, as measured in a parti cular biological 
assay, with or without dose dependency. A nucleic acid fragment encoding a "biologically- 
active portion of NOVX" can be prepared by isolating a portion SEQ ID NOS:l, 3, 5, 7, 9, 11, 
13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 34, 35, 37, 39, 41, 43, 45, 47 and 49, that encodes a 
polypeptide having an NOVX biological activity (the biological activities of the NOVX 

30 proteins are described below), expressing the encoded portion of NOVX protein (e.g., by 

recombinant expression in vitro) and assessing the activity of the encoded portion of NOVX. 
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NOVX Nucleic Acid and Polypeptide Variants 

The invention further encompasses nucleic acid molecules that differ from the 
nucleotide sequences shown in SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 
29, 31, 33, 34, 35, 37, 39, 41, 43, 45, 47 and 49 due to degeneracy of the genetic code and thus 
5 encode the same NOVX proteins as that encoded by the nucleotide sequences shown in SEQ 
ED NOSrl, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21; 23, 25, 27, 29, 31, 33, 34, 35, 37, 39, 41, 43, 45, 
47 and 49. hi another embodiment, an isolated nucleic acid molecule of the invention has a 
nucleotide sequence encoding a protein having an amino acid sequence shown in SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, 48 and 50. 

10 In addition to the human NOVX nucleotide sequences shown in SEQ DD NOS:l, 3, 5, 

7, 9, 11, 13, 15, 17, 19,21,23,25, 27, 29,31,33,34, 35, 37, 39,41,43,45,47 and 49, it will 
be appreciated by those skilled in the art that DNA sequence polymorphisms that lead to 
changes in the amino acid sequences of the NOVX polypeptides.may exist within a population 
(e.g., the human population). Such genetic polymorphism in the NOVX genes may exist 

15 among individuals within a population due to natural allelic variation. As used herein, the 
terms "gene" and "recombinant gene" refer to nucleic acid molecules comprising an open 
reading frame (ORF) encoding an NOVX protein, preferably a vertebrate NOVX protein. 
Such natural allelic variations can typically result in 1-5% variance in the nucleotide sequence 
of the NOVX genes. Any and all such nucleotide variations and resulting amino acid 

20 polymorphisms in the NOVX polypeptides, which are the result of natural allelic variation and 
that do not alter the functional activity of the NOVX polypeptides, are intended to be within 
the scope of the invention. 

Moreover, nucleic acid molecules encoding NOVX proteins from other species, and 
thus that have a nucleotide sequence that differs from the human SEQ ID NOS:l, 3, 5, 7, 9, 

25 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 34, 35, 37, 39, 41, 43, 45, 47 and 49 are intended 
to be within the scope of the invention. Nucleic acid molecules corresponding to natural 
allelic variants and homologues of the NOVX cDNAs of the invention can be isolated based 
on their homology to the human NOVX nucleic acids disclosed herein using the human 
cDNAs, or a portion thereof, as a hybridization probe according to standard hybridization 

30 techniques under stringent hybridization conditions. 

Accordingly, in another embodiment, an isolated nucleic acid molecule of the 

invention is at least 6 nucleotides in length and hybridizes under stringent conditions to the 

nucleic acid molecule comprising the nucleotide sequence of SEQ ID NOS:l, 3, 5, 7, 9, 11, 

13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 34, 35, 37, 39, 41, 43, 45, 47 and 49. In another 
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embodiment, the nucleic acid is at least 10, 25, 50, 100, 250, 500, 750, 1000, 1500, or 2000 or 
more nucleotides in length. In yet another embodiment, an isolated nucleic acid molecule of ■ 
the invention hybridizes to the coding region. As used herein, the'term "hybridizes under 
stringent conditions" is intended to describe conditions for hybridization and washing under 
5 which nucleotide sequences at least 60% homologous to each other typically remain 
hybridized to each other. 

Homologs {i.e., nucleic acids encoding NOVX proteins derived from species other 
than human) or other related sequences (e.g., paralogs) can be obtained by low, moderate or 
high stringency hybridization wi'th all or a portion of the particular human sequence as a probe 

10 using methods well known in the art for nucleic acid hybridization and cloning. 

As used herein, the phrase "stringent hybridization conditions" refers to conditions 
1 under which a probe, primer or oligonucleotide will hybridize to its target sequence, but to no 
other sequences. Stringent conditions are sequence-dependent and will be different in 
different circumstances. Longer sequences hybridize specifically at higher temperatures than 

15 shorter sequences. Generally, stringent conditions are selected to be about 5 °C lower than the 
thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. The 
Tm is the temperature (under defined ionic strength, pH and nucleic acid concentration) at 
which 50% of the probes complementary to the target sequence hybridize to the target 
sequence at equilibrium. Since the target sequences are generally present at excess, at Tm, 

20 50% of the probes are occupied at equilibrium. Typically, stringent conditions will be those in 
which the salt concentration is less than about 1.0 M sodium ion, typically about 0.01 to L0 M 
sodium ion (or other salts) at pH 7.0 to 8.3 and the temperature is at least about 30°C for short 
probes, primers or oligonucleotides (e.g., 10 nt to 50 nt) and at least about 60°C for longer 
probes, primers and oligonucleotides. Stringent conditions may also be achieved with the 

25 addition of destabilizing agents, such as formamide. 

Stringent conditions are known to those skilled in the art and can be found in Ausubel, 
et aL 9 (eds.), Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. 
(1989), 6.3.1-6.3.6. Preferably, the conditions are such that sequences at least about 65%, 
70%, 75%, 85%, 90%, 95%, 98%, or 99% homologous to each other typically remain 

30 hybridized to each other. A non-limiting example of stringent hybridization conditions are 
hybridization in a high salt buffer comprising 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 mM 
EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA, and 500 mg/ml denatured salmon sperm DNA 
at 65°C, followed by one or more washes in 0.2X SSC, 0.01% BSA at 50°C. An isolated 
nucleic acid molecule of the invention that hybridizes under stringent conditions to the 
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sequences SBQ ID N0S:1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 34, 35, 37, 
39, 41, 43, 45, 47 and 49, corresponds to a naturally-occurring nucleic acid molecule. As used 
herein, a "naturally-occurring" nucleic acid molecule refers to an RNA or t)NA molecule 
having a nucleotide sequence that occurs in nature (e.g., encodes a natural protein). 
5 In a second embodiment, a nucleic acid sequence that is hybridizable to the nucleic 

acid molecule comprising the nucleotide sequence of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 
19, 21, 23, 25, 27, 29, 31, 33, 34,' 35, 37, 3$, 41, 43, 45,' 47 and 49, or fragments, analogs or 
derivatives thereof, under conditions of moderate stringency is provided. A non- limiting 
example of moderate stringency hybridization conditions are hybridization in 6X SSC, 5X 

10 Denhardt's solution, 0.5% SDS and 100 mg/ml denatured salmon sperm DNA at 55°C, 

followed by one or more washes in IX SSC, 0.1% SDS at 37°C. Other conditions of moderate 
stringency that may be used are well-known within the art. See, e.g., Ausubel, et al (eds.), 
1993, Current Protocols in Molecular Biology, John Wiley & Sons, NY, and Kriegler, 
1990; Gene Transfer and Expression, A Laboratory Manual, Stockton iPress, NY. 

15 In a third embodiment, a nucleic acid that is hybridizable to the nucleic acid molecule 

comprising the nucleotide sequences SEQ JD NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 
27, 29, 31, 33, 34, 35, 37, 39, 41, 43, 45, 47 and 49, or fragments, analogs or derivatives 
thereof, under conditions of low stringency, is provided. A non-limiting example of low 
stringency hybridization conditions are hybridization in 35% formamide, 5X SSC, 50 rnM 

20 Tris-HCl (pH 7.5), 5 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 1 00 mg/ml denatured 
salmon sperm DNA, 10% (wt/vol) dextran sulfate at 40°C, followed by one or more washes in 
2X SSC, 25 mM Tris-HCl (pH 7.4), 5 mM EDTA, and 0.1% SDS at 50°C. Other conditions of 
low stringency that maybe used are well known in the art (e.g., as employed for cross-species 
hybridizations). See, e.g., Ausubel, et al (eds.), 1993, CURRENT PROTOCOLS IN MOLECULAR 

25 Biology, John Wiley & Sons, NY, and Kriegler, 1 990, Gene Transfer and Expression, A 
Laboratory Manual, Stockton Press, NY; Shilo arid Weinberg, 1981 . Proc Natl Acad Sci 
USA 78: 6789-6792. 

Conservative Mutations 

In addition to naturally-occurring allelic variants of NOVX sequences that may exist in 
30 the population, the skilled artisan will further appreciate that changes can be introduced by 
mutation into the nucleotide sequences SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 
25, 27, 29, 31, 33, 34, 35, 37, 39, 41, 43, 45, 47 and 49, thereby leading to changes in the 
amino acid sequences of the encoded NOVX proteins, without altering the functional ability of 
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said NOVX proteins. For example, nucleotide substitutions leading to amino acid 
substitutions at "non-essential" amino acid residues can be made in the sequence SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 2i, 24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, 48 and 50. 
A "non-essential" amino acid residue is a residue that can' be altered from the wild-type 
5 sequences of the NOVX proteins without altering their biological activity, whereas an 

"essential" amino acid residue is required for such biological activity. For example, amino 
acid residues that are conserved among the NOVX proteins of the invention are predicted to be 
particularly non-amenable to alteration. Amino acids for which conservative substitutions can 
be made are well-known within the art. 

1 0 Another aspect of the invention pertains to nucleic acid molecules encoding NOVX 

proteins that contain changes in amino acid residues that are not essential for activity. Such 
1 NOVX proteins differ in amino acid sequence from SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 
19, 21, 23, 25, 27, 29, 31, 33, 34, 35, 37, 39, 41, 43, 45, 47 and 49 yet retain biological 
activity. In one embodiment, the isolated nucleic acid molecule comprises a nucleotide 

15 sequence encoding a protein, wherein the protein comprises an amino acid sequence at least 
about 45% homologous to the amino acid sequences SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 
18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, 48 and 50. Preferably, the protein 
encoded by the nucleic acid molecule is at least about 60% homologous to SEQ ID NOS:2, 4, 
6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, 48 and 50; more 

20 preferably at least about 70% homologous SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 

24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, 48 and 50; still more preferably at least about 80% 
homologous to SEQ ED NOS:2 ? 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 
40, 42, 44, 46, 48 and 50; even more preferably at least about 90% homologous to SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, 48 and 50; 

25 and most preferably at least about 95% homologous to SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 
18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, 48 and 50. 

An isolated nucleic acid molecule encoding an NOVX protein homologous to the 
protein of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 40, 42, 
44, 46, 48 and 50 can be created by introducing one or more nucleotide substitutions, additions 
30 or deletions into the nucleotide sequence of SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 
23, 25, 27, 29, 31, 33, 34, 35, 37, 39, 41, 43, 45, 47 and 49, such that one or more amino acid 
substitutions, additions or deletions are introduced into the encoded protein. 

Mutations can be introduced into SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 

25, 27, 29, 31, 33, 34, 35, 37, 39, 41, 43, 45, 47 and 49 by standard techniques, such as 
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site-directed mutagenesis and PCR-mediated mutagenesis. Preferably,' conservative amino 
acid substitutions are made at one or more predicted, non-essential amino acid residues. A 
"conservative amino acid substitution" is one in which the amino acid residue is replaced with 
an amino acid residue having a similar side chain. Families of amino acid residues having 
5 similar side chains have been defined within the art. These families include amino acids with 
basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, 
glutamic acid), uncharged polar side chain's (e.g., glycine, asparagine, glutamine, serine, , 
threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, 
proline, phenylalanine, methionine,, tryptophan), beta-branched side chains (e.g., threonine, 

10 valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, 
histidine). Thus, a predicted non-essential amino acid residue in the NOVX protein is 
replaced with another amino acid residue from the same side chain 'family. Alternatively, in 
another embodiment, mutations can be introduced randomly along all or part of an NOVX 
coding sequence, such as by saturation mutagenesis, and the resultant mutants can be screened 

1 5 for NOVX biological activity to identify mutants that retain activity. Following mutagenesis 
SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 34, 35, 37, 39, 41, 43, 
45, 47 and 49, the encoded protein can be expressed by any recombinant technology known in 
the art and the activity of the protein can be determined. 

The relatedness of amino acid families may also be determined based on side chain 

20 interactions. Substituted amino acids may be fully conserved "strong" residues or fully 

conserved "weak" residues. The "strong" group of conserved amino acid residues maybe any 
one of the following groups: STA, NEQK, NHQK, NDEQ, QHRK, MILV, MILF, HY, FYW, 
wherein the single letter amino acid codes are grouped by those amino acids that may be 
substituted for each other. Likewise, the "weak" group of conserved residues may be any one 

25 of the following: CSA, ATV, SAG, STNK, STPA, SGND, SNDEQK, NDEQHK, NEQHRK, 
VLDVf, HFY, wherein the letters within each group represent the single letter amino acid code. 

In one embodiment, a mutant NOVX protein can be assayed for (/) the ability to form 
protein :protein interactions with other NOVX proteins, other cell-surface proteins, or 
biologically-active portions thereof, (ii) complex formation between a mutant NOVX protein 

30 and an NOVX ligand; or (Hi) the ability of a mutant NOVX protein to bind to an intracellular 
target protein or biologically-active portion thereof; (e.g. avidin proteins). 

In yet another embodiment, a mutant NOVX protein can be assayed for the ability to 
regulate a specific biological function (e.g., regulation of insulin release). 
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Antisense Nucleic Acids 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules 
that are hybridizable to or complementary to the nucleic acid molecule comprising the 
nucleotide sequence of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 
5 34, 35, 37, 39, 41, 43, 45, 47 and 49, or fragments, analogs or derivatives thereof. An 

"antisense" nucleic acid comprises a nucleotide sequence that is complementary to a "sense" 
nucleic acid encoding a protein (e.g., complementary to the coding strand of a double- stranded 
cDNA molecule or complementary to an mRNA sequence). In specific aspects, antisense 
nucleic acid molecules are provided that comprise a sequence complementary to at least about 

10 10, 25, 50, ,100, 250 or 500 nucleotides or an entire NOVX coding strand, or to only a portion 
thereof. Nucleic acid molecules encoding fragments, homologs, derivatives and analogs of an 
NOVX protein of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 1 8, 20, 22, 24, 26, 28, 30, 32, 36, 38, 
40, 42, 44, 46, 48 and 50, or antisense nucleic acids complementary to an NOVX nucleic acid 
sequence of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 34, 35, 37, 

15 39, 41 , 43, 45, 47 and 49, are additionally'provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 
region" of the coding strand of a nucleotide sequence encoding an NOVX protein. The term 
"coding region" refers to the region of the nucleotide sequence comprising codons which are 
translated into amino acid residues. In another embodiment, the antisense nucleic acid 

20 molecule is antisense to a "noncoding region" of the coding strand of a nucleotide sequence 

encoding the NOVX protein. The term "noncoding region" refers to 5' and 3' sequences which 
flank the coding region that are not translated into amino acids (z.e., also referred to as 5' and 
3' untranslated regions). 

Given the coding strand sequences encoding the NOVX protein disclosed herein. 

25 antisense nucleic acids of the invention can be designed according to the rules of Watson and 
Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be complementary 
to the entire coding region of NOVX mRNA, but more preferably is an oligonucleotide that is 
antisense to only a portion of the coding or noncoding region of NOVX mRNA. For example, 
the antisense oligonucleotide can be complementary to the region surrounding the translation 

30 start site of NOVX mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 15, 

20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention 

can be constructed using chemical synthesis or enzymatic ligation reactions using procedures 

known in the art. For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) 

can be chemically synthesized using naturally-occurring nucleotides or variously modified 
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nucleotides designed to increase the biological stability of the molecules or to increase the 
physical stability of the duplex formed between the antisense and sense nucleic acids (e.g., 
phosphorothioate derivatives and acridihe substituted nucleotides can be used). 

Examples of modified nucleotides that can be used to generate the antisense nucleic 
acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, 
xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl- 
2-thiouridine, 5-carboxymethylaminomethyluracil, ,dihydrouracil, beta-D-galactosylqueosine, 
inosine, N6-i sop entenyl adenine, 1 -methyl guanine, 1-methylinosine, 2,2-dimethylguanine, 
2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6~adenine, 
V-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, 5'-methoxycarboxymethy]uracil, 5-methoxyuracil, 

2- methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5~methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, 
uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 

3- (3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the 
antisense nucleic acid can be produced biologically using an expression vector into which a 
nucleic acid has been subcloned in an antisense orientation (i.e., RNA transcribed from the 
inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, 
described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered to a 
subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or 
genomic DNA encoding an NOVX protein to thereby inhibit expression of the protein (e.g., by 
inhibiting transcription and/or translation). The hybridization can be by conventional 
nucleotide complementarity to form a stable duplex, or, for example, in the case of an 
antisense nucleic acid molecule that binds to DNA duplexes, through specific interactions in 
the major groove of the double helix. An example of a route of administration of antisense 
nucleic acid molecules of the invention includes direct injection at a tissue site. Alternatively, 
antisense nucleic acid molecules can be modified to target selected cells and then administered 
systemically. For example, for systemic administration, antisense molecules can be modified 
such that they specifically bind to receptors or antigens expressed on a selected cell surface 
(e.g., by linking the antisense nucleic acid molecules to peptides or antibodies that bind to cell 
surface receptors or antigens). The antisense nucleic acid molecules can~also be delivered to 
cells using the vectors described herein. To achieve sufficient nucleic acid molecules, vector 
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constructs in which the antisense nucleic acid molecule is placed under the control of a strong 
pol II or pol III promoter are preferred. 

In yet another embodiment, the antisense nucleic acid rriol'ecule of the invention is an 
oc-anomeric nucleic acid molecule. An a~anomeric nucleic acid molecule forms specific 
5 double-stranded hybrids with complementary RNA in which, contrary to the usual P-units, the 
strands run parallel to each other. See, e.g., Gaultier, et al, 1987. Nucl Acids Res. 15: 
6625-6641. The antisense nucleic acid molecule can also comjprise a 

2-o-methylribonucleotide (See, e.g., Inoue, et al. 1987. Nucl. Acids Res. 15: 6131-6148) or a 

chimeric RNA-DNA analogue (See, e.g., Inoue, et al, 1987. FEES Lett 215: 327-330. 

i i 

10 Rjbozymes and PNA Moieties 

, . Nucleic acid modifications include, by way of non-limiting example, modified bases, 
and nucleic acids whose sugar phosphate backbones are modified or derivatized. These 
modifications are carried out at least in part to enhance the chemical stability of the modified 

nucleic acid, such that they may be used, for example, as antisense binding nucleic acids in 

i 

15 therapeutic applications in a subject. 

In one embodiment, an antisense nucleic acid of the invention is a ribozyme. 
Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of 
cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a 
complementary region. Thus, ribozymes (e.g., hammerhead ribozymes as described in 

>0 Haselhoff and Gerlach 1988. Nature 334: 585-591) can be used to catalytically cleave NOVX 
mRNA transcripts to thereby inhibit translation of NOVX mRNA. A ribozyme having 
specificity for an NOVX-encoding nucleic acid can be designed based upon the nucleotide 
sequence of an NOVX cDNA disclosed herein (i.e., SEQ ID NOS:l, 3, 5, 7, 9, 1 1, 13, 15, 17, 
19, 21, 23, 25, 27, 29, 31, 33, 34, 35, 37, 39, 41, 43, 45, 47 and 49). For example, a derivative 

15 of a Tetrahymena L-19 IVS RNA can be constructed in which the nucleotide sequence of the 
active site is complementary to the nucleotide sequence to be cleaved in an NOVX-encoding 
mRNA. See, e.g., U.S. Patent 4,987,071 to Cech, et al and U.S. Patent 5,1 16,742 to Cech, et 
al. NOVX mRNA can also be used to select a catalytic RNA having a specific ribonuclease 
activity from a pool of RNA molecules. See, e.g., Bartel et al, (1993) Science 261 : 141 1-1418. 

\0 Alternatively, NOVX gene expression can be inhibited by targeting nucleotide 

sequences complementary to the regulatory region of the NOVX nucleic acid (e.g., the NOVX 
promoter and/or enhancers) to form triple helical structures that prevent transcription of the 
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NOVX gene in target cells. See, e.g., Helene, \99\. Anticancer Dnig Des. 6: 569-84; Helene, 
et al. 1992. Ann. N.Y. Acad. Sci. 660: 27-36; Maher, 1992. Bioassays 14: 8.07-15. 

In various embodiments, 'the NOVX nucleic acids can be modified at the base moiety, 
sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, or solubility 
5 of the molecule. For example, the deoxyribose phosphate backbone of the nucleic acids can 
be modified to generate peptide nucleic acids. See, e.g., Hyrup, et al., 1996. BioorgMed 
Chem 4: 5-23. As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic 
acid mimics {e.g., DNA mimics) in which the deoxyribose phosphate backbone is replaced by 
a pseudopepticle backbone and only the four natural nucleobases are retained. The neutral 
10 backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA oligomers can be.performed using 
standard solid phase peptide synthesis protocol^ as described in Hyrup, et ah, 1996. supra; 

Perry-O'Keefe, et al, 1996. Proc. Natl. Acad. Sci. USA 93: 14670-14675. 

i 

PNAs of NOVX can be used in therapeutic and diagnostic application^. For example, 
15 PNAs can be used as antisense or antigene agents for sequence-specific modulation of gene 

expression by, e.g., inducing transcription or translation arrest or inhibiting replication. PNAs 
of NOVX can also be used, for example, in the analysis of single base pair mutations in a gene 
{e.g., PNA directed PCR clamping; as artificial restriction enzymes when used in combination 
with other enzymes, e.g., Si nucleases {See, Hyrup, et al, \996.supra); or as probes or primers 
20 for DNA sequence and hybridization (See, Hyrup, et al., 1996, supra; Perry-O'Keefe, et al., 
1996. supra). 

In another embodiment, PNAs of NOVX can be modified, e.g., to enhance their 
stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 

25 delivery known in the art. For example, PNA-DNA chimeras of NOVX can be generated that 
may combine the advantageous properties of PNA and DNA. Such chimeras allow DNA 
recognition enzymes (e.g., RNase H and DNA polymerases) to interact with the DNA portion 
while the PNA portion would provide high binding affinity and specificity. PNA-DNA 
chimeras can be linked using linkers of appropriate lengths selected in terms of base stacking, 

30 number of bonds between the nucleobases, and orientation (see, Hyrup, et al., 1996. supra). 
The synthesis of PNA-DNA chimeras can be performed as described in Hyrup, et al., 1996. 
supra and Finn, et al, 1996. Nucl Acids Res 24: 3357-3363. For example, a DNA chain can 
be synthesized on a solid support using standard phosphoramidite coupling chemistry, and 
modified nucleoside analogs, e.g., 5 , -(4-methoxytrityl)amino-5 -deoxy- thymidine 

166 

B NSDOC ID: <WO„ 02055704 A2^_ l_ ; 



WO 02/055704 PCT/US02/00554 
phosphoramidite, can be used between the PNA and the 5' end of DNA^ See, e.g., Mag, et ah, 
1989. Nucl Acid Res 17: 15973-5988. PNA monomers are then coupled in a. stepwise manner 
to produce a chimeric molecule with a 5' PNA segment zbd a 3' DNA segment. See, e.g., 
Finn, et ah, 1996. supra. Alternatively, chimeric molecules can be synthesized with a 5' DNA 
5 segment and a 3' PNA segment. See, e.g., Petersen, et ah, 1975. Bioorg. Med. Chem. Lett. 5: 
1119-11124. 

In other embodiments, the oligonucleotide may include other appended groups such as 
peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating transport across 
the cell membrane (see, e.g., Letsinger, et ah, 1989. Proc. Natl Acad. Sci. U.S.A. 86: 

10 6553-6556; Lemaitre, et ah, 1987. Proc. Natl. Acad Sci. 84: 648-652; PCT Publication No. 
WO88/09810) or the blood-brain barrier (see, e.g., PCT Publication No. WO 89/10134).' In 
addition, oligonucleotides can be modified with hybridization triggered cleavage agents {see, 
e.g., Krol, et ah, 1988. BioTechniq,ues 6:958-976) or intercalating agents (see, e.g., Zon, 1988. 
Pharm. Res. 5: 539-549). To this end, the oligonucleotide may be conjugated to another 

15 molecule, e.g., a peptide, a hybridization triggered cross-linking agent, a transport agent, a 

i 

hybridization-triggered cleavage agent, and the like. 

NOVX Polypeptides 

A polypeptide according to the invention includes a polypeptide including the amino 
• acid sequence of NOVX polypeptides whose sequences are provided in SEQ ID NOS:2, 4, 6, 

20 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, 48 and 50. The 

invention also includes a mutant or variant protein any of whose residues may be changed 
from the corresponding residues shown in SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 
24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, 48 and 50 while still encoding a protein that 
maintains its NOVX activities and physiological functions, or a functional fragment thereof 

25 In general, an NOVX variant that preserves NOVX-like function includes any variant 

in which residues at a particular position in the sequence have been substituted by other amino 
acids, and further include the possibility of inserting an additional residue or residues between 
two residues of the parent protein as well as the possibility of deleting one or more residues 
from the parent sequence. Any amino acid substitution, insertion, or deletion is encompassed 

30 by the invention. In favorable circumstances, the substitution is a conservative substitution as 
defined above. 

One aspect of the invention pertains to isolated NOVX proteins, and biologically- 
active portions thereof, or derivatives, fragments, analogs or homologs thereof. Also provided 
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are polypeptide fragments suitable for use as immunogens to raise anti-NOVX antibodies. In 
" one embodiment, native NOVX proteins can be isolated from cells or tissue sources by an 
appropriate purification scheme using standard protein purification. techniques. In another 
embodiment, NOVX proteins are produced by recombinant DNA techniques. Alternative to 
5 recombinant expression, an NOVX protein or polypeptide can be synthesized chemically 
using standard peptide synthesis techniques,. 

An "isolated" or "purified" polypeptide or protein or biologically*- active portion thereof 
is substantially free of cellular material or other contaminating proteins from the cell or tissue 
source from which the NOVX prbtein is derived, or substantially free from chemical 

10 precursors or other chemicals when. chemically synthesized. The language "substantially free 
of cellular material" includes preparations of NOVX proteins in which the protein is separated 
•from cellular components of the cells from which it is isolated or recombinantly-produced. In 
one embodiment, the language "substantially free of cellular material" includes preparations of 
NOVX proteins having less than about 30% (by dry weight) of non-NOVX proteins (also 

15 referred to herein as a "contaminating protein"), more preferably less than about 20% of 

non-NOVX proteins, still more preferably less than about 10% of non-NOVX proteins, and 
most preferably less than about 5% of non-NOVX proteins. When the NOVX protein or 
biologically-active portion thereof is recombinantly-produced, it is also preferably 
substantially free of culture medium, i.e., culture medium represents less than about 20%, 

20 more preferably less than about 1 0%, and most preferably less than about 5% of the volume of 
the NOVX protein preparation. 

The language "substantially free of chemical precursors or other chemicals" includes 
preparations of NOVX proteins in which the protein is separated from chemical precursors or 
other chemicals that are involved in the synthesis of the protein. In one embodiment, the 

25 language "substantially free of chemical precursors or other chemicals" includes preparations 
of NOVX proteins having less than about 30% (by dry weight) of chemical precursors or 
non-NOVX chemicals, more preferably less than about 20% chemical precursors or 
non-NOVX chemicals, still more preferably less than about 10% chemical precursors or 
rion-NOVX chemicals, and most preferably less than about 5% chemical precursors or 

30 non-NOVX chemicals. 

Biologically-active portions of NOVX proteins include peptides comprising amino 
acid sequences sufficiently homologous to or derived from the amino acid sequences of the 
NOVX proteins {e.g., the amino acid sequence shown in SEQ 3D NOS:2, 4, 6, 8, 10, 12, 14, 
16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, 48 and 50) that include fewer amino 
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acids than the full-length NOVX proteins, and exhibit at least one activity of an NOVX 
protein. Typically, biologically-active portions comprise a domain or motif with at least one 
activity of the NOVX protein. A 1 biologically- active portion of an NOVX protein can be a 
polypeptide which is, for example, 10, 25, 50, 100 or more amino acid residues in length. 
5 Moreover, other biologically-active portions, in which other regions of the protein are 

deleted, can be prepared by recombinant techniques and evaluated for one or more of the 
functional activities of a native NOVX protein. ' 

In an embodiment, the NOVX protein has an amino acid sequence shown SEQ ID 
NOS:2, 4, 6, 8', 10, 12, If 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, 48 and 50. 

1 0 In other embodiments, the NOVX protein is substantially homologous to SEQ ID NOS:2, 4, 6, 
8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, 48 and 50, and retains 
the functional activity of the protein of SEQ ID NOS:2, 4, 6, 8, 10, i2, 14, 16, 18, 20, 22, 24, 
26, 28, 30, 32, 36, 38, 40, 42, 44, 46, 48 and 50, yet differs in amino acid sequence due to 
natural allelic variation or mutagenesis, as described in detail, below. Accordingly, in another 

15 embodiment, the NOVX protein is a protein that comprises an amino acid sequence at least 

about 45% homologous to the amino acid sequence SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16,. 18, 
20, 22, 24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, 48 and 50, and retains the functional activity 
of the NOVX proteins of SEQ ID NOS:2, 4, 6,8, 10, 12, 14, 16, 18, 20, 22,24,26,28, 30, 32, 
36, 38, 40, 42, 44, 46, 48 and 50. 

20 Determining Homology Between Two or More Sequences 

To determine the percent homology of two amino acid sequences or of two nucleic 
acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced 
in the sequence of a first amino acid or nucleic acid sequence for optimal alignment with a 
second amino or nucleic acid sequence). The amino acid residues or nucleotides at 

25 corresponding amino acid positions or nucleotide positions are then compared. When a 

position in the first sequence is occupied by the same amino acid residue or nucleotide as the 
corresponding position in the second sequence, then the molecules are homologous at that 
position (i.e., as used herein amino acid or nucleic acid "homology" is equivalent to amino 
acid or nucleic acid "identity"). 

30 The nucleic acid sequence homology may be determined as the degree of identity 

between two sequences. The homology may be determined using computer programs known 
in the art, such as GAP software provided in the GCG program package. See, Needleman and 
Wunsch, 1970. JMolBiol 48: 443-453. Using GCG GAP software with the following settings 



BNSDOCID: «WO_02055704A2_I_: 



WO 02/055704 PCT/US02/00554 

for nucleic acid sequence comparison: GAP creation penalty of 5,0 and GAP extension 
'" penalty of 03, the coding region of the analogous nucleic acid sequences referred to above 
exhibits a degree of identity preferably of at least 70%, 75%, 80%, .85%, 90%, 95%, 98%, or 
99%, with the CDS (encoding) part of the DNA sequence shown in SEQ ID NOS:l, 3, 5, 7, 9, 
5 11, 13, 15, 17, 19,21,23,25,27, 29,31,33, 34, 35, 37, 39, 41, 43, 45, 47 and 49. 

The term "sequence identity" refers to the. degree to which two polynucleotide or 
polypeptide sequences are identical on a residue-by-residue basis over a particular region of 
comparison. The term "percentage of sequence identity" is calculated by comparing two 
optimally aligned sequences over 1 that region of comparison, determining the number of 

10 positions at which the identical nucleic acid base (e.g., A, T, C, G, U, or I, in the case of 

nucleic acids) occurs in both sequences to yield the number of matched positions, dividing the 
■number of matched positions by the total number of positions in the region of comparison (i.e., 
the window size), and multiplying the result by 100 to yield the percentage of sequence 
identity. The term "substantial identity" as used herein denotes a characteristic of a 

15 polynucleotide sequence, wherein the polynucleotide comprises a sequence that has at least 80 
percent sequence identity, preferably at least 85 percent identity and often 90 to 95 percent 
sequence identity, more usually at least 99 percent sequence identity as compared to a 
reference sequence over a comparison region. 

Chimeric and Fusion Proteins 

20 The invention also provides NOVX chimeric or fusion proteins. As used herein, an 

NOVX "chimeric protein" or "fusion protein" comprises an NOVX polypeptide operatively- 
linked to a non-NOVX polypeptide. An "NOVX polypeptide" refers to a polypeptide having 
an amino acid sequence corresponding to an NOVX protein SEQ ID NOS:2, 4, 6, 8, 10, 12, 
14, 16, 1 8, 20, 22, 24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, 48 and 50, whereas a "non-NOVX 

25 polypeptide" refers to a polypeptide having an amino acid sequence corresponding to a protein 
that is not substantially homologous to the NOVX protein, e.g., a protein that is different from 
the NOVX protein and that is derived from the same or a different organism. Within an 
NOVX fusion protein the NOVX polypeptide can correspond to all or a portion of an NOVX 
protein. In one embodiment, an NOVX fusion protein comprises at least one biologically- 

30 active portion of an NOVX protein. In another embodiment, an NOVX fusion protein 
comprises at least two biologically-active portions of an NOVX protein. In yet another 
embodiment, an NOVX fusion protein comprises at least three biologically-active portions of 
an NOVX protein. Within the fusion protein, the term "operatively-linked" is intended to 
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indicate that the NOVX polypeptide and the non-NOVX polypeptide are fused in-frame with 
" one another. The non-NOVX polypeptide can be fused to the N-terminus or C-terminus of the 
NOVX polypeptide. ' ' 

In one embodiment, the fusion protein is a GST-NOVX fusion protein in which the 
5 NOVX sequences are fused to the C-terminus of the GST (glutathione S-transferase) 
sequences. Such fusion proteins can facilitate the purification of recombinant NOVX 
polypeptides. - » 1 

In another embodiment, the fusion protein is an NOVX protein containing a 
heterologous signal sequence at its N-terminus. In certain host cells (e.g., mammalian host 
10 cells), expression and/or secretion of NOVX can be increased through use of a heterologous 
signal sequence. 

hi yet another embodiment, the fusion protein is an NQVX-immunoglobulin fusion 
protein in which the NOVX sequences are fused to sequences derived from a member of the 
immunoglobulin protein family. The NOVX-immunoglobulin fusion proteins of the invention 

15 can be incorporated into pharmaceutical compositions and administered to a subject to inhibit 
an interaction between an NOVX ligand and an NOVX protein on the surface of a cell, to 
thereby suppress NOVX-mediated signal transduction in vivo. The NOVX-immuno globulin 
fusion proteins can be used to affect the bioavailability of an NOVX cognate ligand. 
Inhibition of the NOVX ligand/NOVX interaction may be useful therapeutically for both the 

20 treatment of proliferative and differential ve disorders, as well as modulating (e.g. promoting 
or inhibiting) cell survival. Moreover, the NOVX-immunoglobulin fusion proteins of the 
invention can be used as immunogens to produce anti-NOVX antibodies in a subject, to purify 
NOVX ligands, and in screening assays to identify molecules that inhibit the interaction of 
NOVX with an NOVX ligand. 

25 An NOVX chimeric or fusion protein of the invention can be produced by standard 

recombinant DNA techniques. For example, DNA fragments coding for the different 
polypeptide sequences are ligated together in-frame in accordance with conventional 
techniques, e.g., by employing blunt-ended or stagger-ended termini for ligation, restriction 
enzyme digestion to provide for appropriate termini, filling-in of cohesive ends as appropriate, 

30 alkaline phosphatase treatment to avoid undesirable joining, and enzymatic ligation. In 

another embodiment, the fusion gene can be synthesized by conventional techniques including 
automated DNA synthesizers. Alternatively, PCR amplification of gene fragments can be 
carried out using anchor primers that give rise to complementary overhangs between two 
consecutive gene fragments that can subsequently be annealed and reamplified to generate a 
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chimeric gene sequence (see, e.g., Ausubel, et ah (eds.) CURRENT PROTOCOLS IN MOLECULAR 
Biology, John Wiley & Sons, 1992). Moreover, many expression vectors, are commercially 
available that already encode a fusion moiety (e.g., a GST polypeptide). An NOVX-encoding 
nucleic acid can be cloned into such an expression vector such that the fusion moiety is linked 
5 in- frame to the NOVX protein. 

NOVX Agonists and Antagonists 

t 

The invention also pertains to variants of the NOVX proteins that function as either 
NOVX agonists (i.e., mimetics) or as NOVX antagonists. Variants of the NOVX protein can 
be generated by mutagenesis (e.g., discrete point mutation or truncation of the NOVX protein). 

1 0 An agonist 'of the NOVX protein can retain substantially the same, or a subset of, the 

biological activities of the naturally occurring form of the NOVX protein. An antagonist of 
the NOVX protein can inhibit one or more of the activities of the naturally occurring form of 
the NOVX protein by, for example, competitively binding to a downstream or upstream 
member of a cellular signaling cascade which includes the NOVX protein. Thus, specific 

15 biological effects can be elicited by treatment with a variant of limited function. In one 

embodiment, treatment of a subject with a variant having a subset of the biological activities 
of the naturally occurring form of the protein has fewer side effects in a subject relative to 
treatment with the naturally occurring form of the NOVX proteins. 

Variants of the NOVX proteins that function as either NOVX agonists (i.e., mimetics) 

20 or as NOVX antagonists can be identified by screening combinatorial libraries of mutants 
(e.g., truncation mutants) of the NOVX proteins for NOVX protein agonist or antagonist 
activity. In one embodiment, a variegated library of NOVX variants is generated by 
combinatorial mutagenesis at the nucleic acid level and is encoded by a variegated gene 
library. A variegated library of NOVX variants can be produced by, for example, 

25 enzymatically ligating a mixture of synthetic oligonucleotides into gene sequences such that a 
degenerate set of potential NOVX sequences is expressible as individual polypeptides, or 
alternatively, as a set of larger fusion proteins (e.g., for phage display) containing the set of 
NOVX sequences therein. There are a variety of methods which can be used to produce 
libraries of potential NOVX variants from a degenerate oligonucleotide sequence. Chemical 

30 synthesis of a degenerate gene sequence can be performed in an automatic DNA synthesizer, 
and the synthetic gene then ligated into an appropriate expression vector. Use of a degenerate 
set of genes allows for the provision, in one mixture, of all of the sequences encoding the 
desired set of potential NOVX sequences. Methods for synthesizing degenerate 
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oligonucleotides are well-known within the art. See, e.g., Narang, 1983. Tetrahedron 39: 3; 
Itakura, et ah, 1984. Annu. Rev. Biochem. 53: ^23; Itakura, et ah, 1984. Science 198: 1056; 
Ike, et aL, 1983. Nucl Acids Res\ 1 1 : 477. 

Polypeptide Libraries 

In addition, libraries of fragments of the NOVX protein coding sequences can be used 

i 

to generate a variegated population of NOVX fragments for screening and subsequent , 

i 

selection of variants of an NOVX protein. In one embodiment, a library of coding sequence 
fragments can, be generated by treating a double stranded PCR fragment of an NOVX coding 
sequence with a nuclease under conditions wherein nicking occurs only about once per 
molecule, denaturing the double stranded DNA, renaturing the DNA to form double-strarided 
DNA that can include sense/anti sense pairs from different nicked prpducts, removing single 
stranded portions from reformed duplexes by treatment with Si nuclease, and ligating the 
resulting fragment library into an expression vector. By this method, expression libraries can 
be derived which 'encodes N-terminal and internal fragments of various sizes of the NOVX 
proteins. 

Various techniques are known in the art for screening gene products of combinatorial 
libraries made by point mutations or truncation, and for screening cDNA libraries for gene 
products having a selected property. Such techniques are adaptable for rapid screening of the 
gene libraries generated by the combinatorial mutagenesis of NOVX proteins. The most 
widely used techniques, which are amenable to high throughput analysis, for screening large 
gene libraries typically include cloning the gene library into replicable expression vectors, 
transforming appropriate cells with the resulting library of vectors, and expressing the 
combinatorial genes under conditions in which detection of a desired activity facilitates 
isolation of the vector encoding the gene whose,product was detected. Recursive ensemble 
mutagenesis (REM), a new technique that enhances the frequency of functional mutants in the 
libraries, can be used in combination with the screening assays to identify NOVX variants. 
See, e.g., Arkin and Yourvan, 1992. Proc. Natl Acad. Set USA 89: 7811-7815; Delgrave, et 
aL, 1993. Protein Engineering 6:327-331. 

Antibodies 

The term "antibody" as used herein refers to immunoglobulin molecules and 
immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules that 
contain an antigen binding site that specifically binds (immunoreacts with) an antigen. Such 
antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, F ab , 
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Fab' and F (ab .) 2 fragments, and an F ab expression library. In general, antibody molecules 
- obtained from humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, which differ , 
from one another by the nature of the heavy chain present in the molecule. Certain classes 
have subclasses as well, such as IgGi, IgG2, and others. Furthermore, in humans, the light 
5 chain may be a kappa chain or a lambda chain. Reference herein to antibodies includes a 
reference to all such classes, subclasses and types of human antibody species. 

An isolated protein of the invention intended to serve as an antigen, or a portion or 
fragment thereof, can be used as an immunogen to generate antibodies that 
immunospecifically bind the antigen, using standard techniques for polyclonal and monoclonal 

10 anybody preparation. The full-length protein can be used or, alternatively, the invention 1 
provides antigenic peptide fragments of the antigen for use as immunogens. An antigenic 
, peptide fragment comprises at least 6 amino acid residues of the amino acid sequence of the 
full length protein, such as an amino acid sequence shown in SEQ JD NOS:2, 4, 6, 8, 10, 12, 
14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, 48 and 50, and encompasses an 

1 5 epitope thereof such that an antibody raised against the peptide forms a specific immune 
complex with the full length protein or with any fragment that contains the epitope. 
Preferably, the antigenic peptide comprises at least 10 amino acid residues, or at least 15 
amino acid residues, or at least 20 amino acid residues, or at least 30 amino acid residues. 
Preferred epitopes encompassed by the antigenic peptide are regions of the protein that are 

20 located on its surface; commonly these are hydrophilic regions. 

In certain embodiments of the invention, at least one epitope encompassed by the 
antigenic peptide is a region of SECX that is located on the surface of the protein, e.g., a 
hydrophilic region. A hydrophobicity analysis of the human SECX protein sequence will 
indicate which regions of a SECX polypeptide are particularly hydrophilic and, therefore, are 

25 likely to encode surface residues useful for targeting antibody production. As a means for 
targeting antibody production, hydropathy plots showing regions of hydrophilicity and 
hydrophobicity may be generated by any method well known in the art, including, for 
example, the Kyte Doolittle or the Hopp Woods methods, either with or without Fourier 
transformation. See, e.g., Hopp and Woods, 1981, Proc. Nat. Acad. Sci. USA 78: 3824-3828; 

30 Kyte and Doolittle 1982, J. Mol. Biol. 157: 105-142, each incorporated herein by reference in 
their entirety. Antibodies that are specific for one or more domains within an antigenic protein, 
or derivatives, fragments, analogs or homologs thereof, are also provided herein. 
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A protein of the invention, or a derivative, fragment, analog, homolog or ortholog 
thereof, may be utilized as an immunogen in the generation of antibodies that 
immunospecifically bind these protein components. 

Various procedures known within the art may be used for the production of polyclonal 
5 or monoclonal antibodies directed against a protein of the invention, or against derivatives, 

fragments, analogs homologs or orthologs thereof (see, for example, Antibodies: A Laboratory 
Manual, Harlow E, and Lane D, 1 988, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, incorporated herein by reference). Some of these antibodies are discussed below. 

Polyclonal Antibodies 

10 For the production of polyclonal antibodies, various suitable host animals (e.g., rabbit, 

goat, mouse or other mammal) may be immunized by one or more injections with the native 
protein, a synthetic variant thereof, or a derivative of the foregoing. An appropriate 
immunogenic preparation can contain, for example, the naturally occurring immunogenic 
protein, a chemically synthesized polypeptide representing the immunogenic protein, or a 

1 5 recombinantly expressed immunogenic protein. Furthermore, the protein may be conjugated 
to a second protein known to be immunogenic in the mammal being immunized. Examples of 
such immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum 
albumin, bovine thyroglobulin, and soybean trypsin inhibitor. The preparation can further 
include an adjuvant. Various adjuvants used to increase the immunological response include, 

20 but are not limited to, Freund's (complete and incomplete), mineral gels (e.g., aluminum 
hydroxide), surface active substances (e.g., lysolecithin, pluronic polyols, polyanions, 
peptides, oil emulsions, dinitrophenol, etc.), adjuvants usable in humans such as Bacille 
Calmette-Guerin and Corynebacterium parvum, or similar immuno stimulatory agents. 
Additional examples of adjuvants which can be employed include MPL-TDM adjuvant 

25 (monophosphoryl Lipid A, synthetic trehalose dicorynomycolate). 

The polyclonal antibody molecules directed against the immunogenic protein can be 
isolated from the mammal (e.g., from the blood) and further purified by well known 
techniques, such as affinity chromatography using protein A or protein G, which provide 
primarily the IgG fraction of immune serum. Subsequently, or alternatively, the specific 

30 antigen which is the target of the immunoglobulin sought, or an epitope thereof, may be 
immobilized on a column to purify the immune specific antibody by immunoaffmity 
chromatography. Purification of immunoglobulins is discussed, for example, by D. Wilkinson 
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(The Scientist, published by The Scientist, hie, Philadelphia PA, Vol. 14, No. 8 (April. 17, 

2000), pp. 25-28). " "' " 1 ' 

i , v 

Monoclonal Antibodies 

The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", as 
5 used herein, refers to a population of antibody molecules that contain only one molecular 
species of antibody molecule consisting of a unique light chain gene product and a unique 
heavy chain gene product. In particular, the complementarity determining regions (CDRs) of 
the monoclonal antibody are identical in all the molecules of the population. MAbs thus 
contain an antigen binding site capable of immunoreacting with a particular epitope of the 

1 0 antigen characterized by a unique binding affinity for it. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature, 256:495 (1975). In a hybridoma method, a mouse, 
hamster, or other appropriate host animal, is typically immunized with an immunizing agent to 
elicit lymphocytes that produce or are capable of producing antibodies that will specifically 

15 bind to the immunizing agent. Alternatively, the lymphocytes can be immunized in vitro. 

The immunizing agent will typically include the protein antigen, a fragment thereof or 
a fusion protein thereof. Generally, either peripheral blood lymphocytes are used if cells of 
human origin are desired, or spleen cells or lymph node cells are used if non-human 
mammalian sources are desired. The lymphocytes are then fused with an immortalized cell 

20 line using a suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell 

[Goding, Monoclonal Antibodies: Principles and Practice , Academic Press, (1986) pp. 59- 
103], Immortalized cell lines are usually transformed mammalian cells, particularly myeloma 
cells of rodent, bovine and human origin. Usually, rat or mouse myeloma cell lines are 
employed. The hybridoma cells can be cultured in a suitable culture medium that preferably 

25 contains one or more substances that inhibit the growth or survival of the unfused, 

immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine guanine 
phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas 
typically will include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which 
substances prevent the growth of HGPRT- deficient cells. 

30 Preferred immortalized cell lines are those that fuse efficiently, support stable high 

level expression of antibody by the selected antibody-producing cells, and are sensitive to a 
medium such as HAT medium. More preferred immortalized cell lines are murine myeloma 
lines, which can be obtained, for instance, from the Salk Institute Cell Distribution Center, San 
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Diego, California and the American Type Culture Collection, Manassas, Virginia. Human 
myeloma and mouse-human heteromyeloma cell lines also have been described for the 
production of human monoclonal antibodies [Kozbor, J ' Immunol. . 133:3001 (1984); Brodeur 
et al., Monoclonal Antibody Production Techn i ques and Applications , Marcel Dekker, Inc., 
5 New York, (1987) pp. 51-63]. 

The culture medium in which the hybridoma cells are cultured can then be assayed for 
the presence of monoclonal antibodies directed against the antigen. Preferably, the binding 
specificity of monoclonal antibodies produced by the hybridoma cells is determined by 
irnmunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (RIA) or 

10 enzyme-linked imxminoabsorbent assay (ELISA). Such techniques and assays are known in 
the art. The binding affinity of the monoclonal antibody can, for example, be determined by 
the Scatchard analysis of Munson and Pollard, Anal. Biochereu 107:220 (1980). It is an 
objective, especially important in therapeutic applications of monoclonal antibodies, to 
identify antibodies having a high degree of specificity and a high binding affinity for the target 

.5 antigen. 

After the desired hybridoma cells are identified, the clones can be subcloned by 
limiting dilution procedures and grown by standard methods (Goding,19S6). Suitable culture 
media for this purpose include, for example, Dulbecco's Modified Eagle's Medium and RPMI- 
1 640 medium. Alternatively, the hybridoma cells can be grown in vivo as ascites in a 
10 mammal. 

The monoclonal antibodies secreted by the subclones can be isolated or purified from 
the culture medium or ascites fluid by conventional immunoglobulin purification procedures 
such as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel 
electrophoresis, dialysis, or affinity chromatography. 

:5 The monoclonal antibodies can also be made by recombinant DNA methods, such as 

those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of 
the invention can be readily isolated and sequenced using conventional procedures (e.g., by 
using oligonucleotide probes that are capable of binding specifically to genes encoding the 
heavy and light chains of murine antibodies). The hybridoma cells of the invention serve as a 

0 preferred source of such DNA. Once isolated, the DNA can be placed into expression vectors, 
which are then transfected into host cells such as simian COS cells, Chinese hamster ovary 
(CHO) cells, or myeloma cells that do not otherwise produce immunoglobulin protein, to 
obtain the synthesis of monoclonal antibodies in the recombinant host cells. The DNA also 
can be modified, for example, by substituting the coding sequence for human heavy and light 
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chain constant domains in place of the homologous murine sequences (U.S. Patent No. 
4,81 6,567; Morrison, Nature 368, '812-13 (1994)) or by covalently joining to the 
immunoglobulin coding sequence all or part of the coding sequence for a non-imifrunoglobulin 
polypeptide. Such a non-immunoglobulin polypeptide can be substituted for the constant 
5 domains of an antibody of the invention, or can be substituted for the variable domains of one 
antigen-combining site of an antibody of the invention to create a chimeric bivalent antibody. 

Humanized Antibodies 1 ; 

The antibodies directed against the protein antigens of the invention can further 
comprise humanized antibodies or human antibodies. These antibodies are suitable for 

1 0 administration to humans without engendering an immune response by the human against the 
administered immunoglobulin. Humanized forms of antibodies are chimeric immunoglobulins, 
immunoglobulin chains or fragments thereof (stich as Fv, Fab, Fab', F(ab')2 or other antigen- 
binding subsequences of antibodies) that are principally comprised of the sequence of a human 
immunoglobulin; and contain minimal sequence derived from a non-human immunoglobulin. 

15 Humanization can be performed following the method of Winter and co-workers (Jones et al., 
Nature, 321:522-525 (1986); Riechmann et al., Nature, 332:323-327 (1988); Verhoeyen et ah, 
Science , 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the 
corresponding sequences of a human antibody. (See also U.S. Patent No. 5,225,539.) In some 
instances, Fv framework residues of the human immunoglobulin are replaced by 

20 corresponding non-human residues. Humanized antibodies can also comprise residues which 
are found neither in the recipient antibody nor in the imported CDR or framework sequences. 
In general, the humanized antibody will comprise substantially all of at least one, and typically 
two, variable domains, in which all or substantially all of the CDR regions correspond to those 
of a non-human immunoglobulin and all or substantially all of the framework regions are 

25 those of a human immunoglobulin consensus sequence. The humanized antibody optimally 
also will comprise at least a portion of an immunoglobulin constant region (Fc), typically that 
of a human immunoglobulin (Jones et ah, 1986; Riechmann et al, 1988; and Presta, Curr. Op. 
Struct. Biol. , 2:593-596 (1992)). 

Human Antibodies 

30 Fully human antibodies essentially relate to antibody molecules in which the entire 

sequence of both the light chain and the heavy chain, including the CDRs, arise from human 
genes. Such antibodies are termed "human antibodies", or "fully human antibodies' 5 herein. 
Human monoclonal antibodies can be prepared by the trioma technique; the human B-cell 
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hybridoma technique (see Kozbor, et al., 1983 Immunol Today 4: 72) and the EBV hybridoma 
technique to produce human monoclonal antibodies (see Cole, et al., 1985 In* MONOCLONAL 
Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Human Aionoclona] 
antibodies may be utilized in the practice of the present invention and maybe produced by 
5 using human hybridomas (see Cote, et al., 1983. Proc Natl Acad Sci USA 80: 2026-2030) or 
by transforming human B-cells with Epstein Barr Virus in vitro (see Cole, et al., 1985 In: 
Monoclonal Antibodies and Cancer Therapy, Aian R. Liss, Inc., pp. 77-96). 1 . 

In addition, human antibodies can also be produced using additional techniques, 
including pha^e display libraries (fjoogenboom and Winter, TMol. Biol. , 227:381 (1993); 

10 Marks et al., J. MoT Biol. , 222:581 (1991)). Similarly, human antibodies can be made by 
introducing human immunoglobulin loci into transgenic animals, e.g., mice in which the' 
endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in 
humans in all respects, including gene rearrangement, assembly, and antibody repertoire. This 

1 5 approach is described, for example, in US, Patent Nos. 5,545,807'; 5,545,806; 5,569,825; 

5,625,126; 5,633,425; 5,661,016, and in Marks et al. (Bio/Technology 10. 779-783 (3992)); 
Lonberg et al. ( Nature 368 856-859 (1994)); Moni son (Nature 368, 812-13 (1994)); Fishwild 
et al,( Nature Biotechnolog y 14, 845-51 (1996)); Neuberger ( Nature Biotechnology 14, 826 
(1 996)); and Lonberg and Huszar ( Intern Rev. Immunol. 13 65-93 (1995)). 

20 Human antibodies may additionally be produced using transgenic nonhuman animals 

which are modified so as to produce fully human antibodies rather than the animal's 
endogenous antibodies in response to challenge by an antigen. (See PCT publication 
WO94/02602). The endogenous genes encoding the heavy and light immunoglobulin chains in 
the nonhuman host have been incapacitated, and active loci encoding human heavy and light 

25 chain immunoglobulins are inserted into the host's genome. The human genes are 

incorporated, for example, using yeast artificial chromosomes containing the requisite human 
DNA segments. An animal which provides all the desired modifications is then obtained as 
progeny by crossbreeding intermediate transgenic animals containing fewer than the full 
complement of the modifications. The preferred embodiment of such a nonhuman animal is a 

30 mouse, and is termed the Xenomouse™ as disclosed in PCT publications WO 96/33735 and 
WO 96/34096. This animal produces B cells which secrete fully human immunoglobulins. 
The antibodies can be obtained directly from the animal after immunization with an 
imm unogen of interest, as. for example, a preparation of a polyclonal antibody, or alternatively 
from immortalized B cells derived from the animal, such as hybridomas producing 
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monoclonal antibodies. Additionally, the genes encoding the immunoglobulins with human 
" variable regions can be recovered and expressed to obtain the antibodies directly, or can be 
further modified to obtain analogs of antibodies such as, for example, single chain Fv 
molecules. 

5 An example of a method of producing a nonhuman host, exemplified as a mouse, 

lacking expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. Patent 
No. 5,939,598. It can be obtained by a method including deleting the J segment genes from at 
least one endogenous heavy chain locus in an embryonic stem cell to prevent rearrangement of 
the locus and to prevent formation of a transcript of a rearranged immunoglobulin heavy chain 
10 locus, the deletion being effected by a targeting vector containing a gene encoding a selectable 
marker; and producing from the embryonic stem cell a transgenic mouse whose somatic and 
i germ cells contain the gene encoding the selectable marker. 

A method for producing an antibody of interest, such as a human antibody, is disclosed 
in U.S. Patent No. 5,91 6,771 . It includes introducing an expression vector that contains a 

15 nucleotide sequence encoding a heavy chain into one mammalian host cell in culture, 

i 

introducing an expression vector containing a nucleotide sequence encoding a light chain into 
another mammalian host cell, and fusing the two cells to form a hybrid cell. The hybrid cell 
expresses an antibody containing the heavy chain and the light chain. 

In a further improvement on this procedure, a method for identifying a clinically 
20 relevant epitope on an immunogen, and a correlative method for selecting an antibody that 
binds immunospecifically to the relevant epitope with high affinity, are disclosed in PCT 
publication WO 99/53049. 

F a b Fragments and Single Chain Antibodies 

According to the invention, techniques can be adapted for the production of 
25 single-chain antibodies specific to an antigenic protein of the invention (see . e.g., U.S. Patent 
No. 4,946,778). In addition, methods can be adapted for the construction of F a b expression 
libraries (see e.g., Huse, et al., 1989 Science 246: 1275-1281) to allow rapid and effective 
identification of monoclonal F ab fragments with the desired specificity for a protein or 
derivatives, fragments, analogs or homologs thereof. Antibody fragments that contain the 
30 idiotypes to a protein antigen may be produced by techniques known in the art including, but 
not limited to: (i) an F( a b>)2 fragment produced by pepsin digestion of an antibody molecule; (ii) 
an Fab fragment generated by reducing the disulfide bridges of an F( ab ')2 fragment; (iii) an F ab 
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fragment generated by the treatment of the antibody molecule with papain and a reducing 
agent and (iv) F v fragments. 

Bispecific Antibodies 

Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that 
5 have binding specificities for at least two different antigens. In the present case, one of the 

binding specificities is for an antigenic protein of the invention. The second binding target is 

i 

any other antigen, and advantageously is a cell-surface protein or receptor or receptor subunit. 

Methods for making bispecific antibodies are known in the art. Traditionally, the 
recombinant production of bispecific antibodies is based on the co-expression of two i 

1 0 immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different 
_ specificities (Milstein and Cuello, Nature , 305:537-539 (1983)). Because of the random 
assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) produce 
a potential mixture of ten different antibody molecules, of which only one has the correct 
bispecific structure. The purification of the correct molecule is usually accomplished by 

1 5 affinity chromatography steps. Similar procedures are disclosed in WO 93/08829, published 
13 May 1993, and in Traunecker et aL, EMBOJ., 10:3655-3659 (1991). 

Antibody variable domains with the desired binding specificities (antibody-antigen 
combining sites) can be fused to immunoglobulin constant domain sequences. The fusion 
preferably is with an immunoglobulin heavy-chain constant domain, comprising at least part 

20 of the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant 

region (CHI) containing the site necessary for light-chain binding present in at least one of the 
fusions. DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the 
immunoglobulin \ight chain, are inserted into separate expression vectors, and are co- 
transfected into a suitable host organism. For further details of generating bispecific 

25 antibodies see, for example, Suresh et al., Methods in Enzvmology , 121 :210 (1986). 

According to another approach described in WO 96/2701 1, the interface between a pair 
of antibody molecules can be engineered to maximize the percentage of heterodimers which 
are recovered from recombinant cell culture. The preferred interface comprises at least a part 
of the CH3 region of an antibody constant domain. In this method, one or more small amino 

30 acid side chains from the interface of the first antibody molecule are replaced with larger side 
chains (e.g. tyrosine or tryptophan). Compensatory "cavities" of identical or similar size to the 
large side chain(s) are created on the interface of the second antibody molecule by replacing 
large amino acid side chains with smaller ones (e.g. alanine or threonine). This provides a 
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mechanism for increasing the yield of the heterodimer over other unwanted end-products such 
as homodimers. " ' 

Bispecific antibodies can 1 be prepared as full length antibodies or antibody fragments 
(e.g. F(ab') 2 bispecific antibodies). Techniques for generating bispecific antibodies from 
5 antibody fragments have been described in the literature. For example, bispecific antibodies 
can be prepared using chemical linkage. Brennan et al., Science 229:81 (1985) describe a 
procedure wherein intact antibodies are proteolytically cleaved to generate F(ab')2 fragments. 
These fragments are, reduced in the presence of the dithiol complexing agent sodium arsenite 

to stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab' 

i 1 

10 fragments generated are then converted to thionitrobenzoate (TNB) derivatives. One of the 
Fab'-TNB derivatives is then reconverted to the Fab'-thiol by reduction with 
mercaptoethylamine and is mixed with an equimolar amount of the bther Fab'-TNB derivative 
to form the bispecific antibody. The bispecific antibodies produced can be used as agents for 
the selective immobilization of enzymes. 1 

15 Additionally, Fab' fragments can be directly recovered from E. coli and chemically 

coupled to form bispecific antibodies. Shalaby et al., J. . Exp. Med. 175:217-225 (1992) 
describe the production of a fully humanized bispecific antibody F(ab') 2 molecule. Each Fab 5 
fragment was separately secreted from E. coli and subjected, to directed chemical coupling in 
vitro to form the bispecific antibody. The bispecific antibody thus formed was able to bind to 

20 cells overexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic 
activity of human cytotoxic lymphocytes against human breast tumor targets. 

Various techniques for making and isolating bispecific antibody fragments directly 
from recombinant cell culture have also been described. For example, bispecific antibodies 
have been produced using leucine zippers. Kostelny et al. } J. Immunol. 148(5):1547-1553 

25 (1992). The leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' 

portions of two different antibodies by gene fusion. The antibody homodimers were reduced 
at the hinge region to form monomers and then re-oxidized to form the antibody heterodimers. 
This method can also be utilized for the production of antibody homodimers. The "diabody" 
technology described by Hollinger et al., Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993) has 

30 provided an alternative mechanism for making bispecific antibody fragments. The fragments 
comprise a heavy-chain variable domain (V H ) connected to a light-chain variable domain (Vl) 
by a linker which is too short to allow pairing between the two domains on the same chain. 
Accordingly, the V H and V L domains of one fragment are forced to pair with the 
complementary V L and V H domains of another fragment, thereby forming two antigen-binding 
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sites. Another strategy for making bispecific antibody fragments by the use of single-chain Fv 
(sFv) dimers has also been reported. See, Gruber et al, L Immunol, 152:5368 (1994). 

Antibodies with more than two valencies are contemplated. For exafnple, trispeciflc 
antibodies can be prepared. Tutt et al., J, Immunol. 147:60 (1991). 
) 5 Exemplary bispecific antibodies can bind to two different epitopes, at least one of 

which originates in the protein antigen of the invention. Alternatively, an anti-antigenic ann 
of an immunoglobulin molecule can be cofnbined with 'an arm which binds to a triggering, 
molecule on a leukocyte such as a T-cell receptor molecule (e.g. CD2, CD3, CD28, or B7), or 
Fc receptors for IgG (FeyR), such ^s FcyRI (CD64), FcyRII (CD32) and FcyRIII (CD16) so as 
10 to focus cellular defense mechanisms to the cell expressing the particular antigen. Bispecific 
antibodies can also be used to direct cytotoxic agents to cells which express a particular ' 
antigen. These antibodies possess an antigen-binding ann and an arm which binds a cytotoxic 
agent or a radionuclide chelator, such as EOTUBE, DPTA, DOTA, or TETA. Another 
bispecific antibody of interest binds the protein antigen described herein and further binds 
15 tissue factor (TF). 

Heteroconjugate Antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. 
Heteroconjugate antibodies are composed of two covalently joined antibodies. Such 
3 antibodies have, for example, been proposed to target immune system cells to unwanted cells 

20 ' (U.S. Patent No. 4,676,980), and for treatment of HIV infection (WO 91/00360; WO 

92/200373; EP 03089). It is contemplated that the antibodies can be prepared in vitro using 
known methods in synthetic protein chemistry, including those involving crosslinking agents. 
For example, immunotoxins can be constructed using a disulfide exchange reaction or by 
forming a thioether bond. Examples of suitable reagents for this purpose include iminothiolate 

25 and methyl-4-mercaptobutyrimidate and those disclosed, for example, in U.S. Patent No. 
4,676,980. 

Effector Function Engineering 

It can be desirable to modify the antibody of the invention with respect to effector 
function, so as to enhance, e.g., the effectiveness of the antibody in treating cancer. For 
30 example, cysteine residue(s) can be introduced into the Fc region, thereby allowing interchain 
disulfide bond formation in this region. The homodimeric antibody thus generated can have 
improved internalization capability and/or increased complement-mediated cell killing and 
antibody-dependent cellular cytotoxicity (ADCC). See Caron et al., J. Exp Med .. 176: 1191- 
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1 195 (1992) and Shopes, J. Immunol ., 148: 2918-2922 (1992). Homodimeric antibodies with 
'" enhanced anti-tumor activity can also be prepared using heterobiftmctional cross-linkers as 
described in Wolff et al. Cancer Research , 53: 2560-2565 (1993). Alternatively, an antibody 
can be engineered that has dual Fc regions and can thereby have enhanced complement lysis 
5 and ADCC capabilities. See Stevenson et al., Anti-Cancer Drug Design, 3: 219-230 (1989). 

Immunoconjugates 

The invention also pertains to immunoconju gates comprising an antibody conjugated 
to a cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an enzymatically active 
t,oxin of bacterial, fungal, plant, or animal origin, or fragments thereof), or a radioactive , 

1 0 isotope (i.e., a radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugates have 
been described above. Enzymatically active toxins and fragments thereof that can be used 
include diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A chain 
(from Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, alpha-sarcin, 

1 5 Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins (PAPI, PAPII, and 
PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, 
gelonin, mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes. A variety of 
radionuclides are available for the production of radioconjugated antibodies. Examples 
include 212 Bi, 131 1, 131 In, 90 Y, and ,86 Re. 

20 Conjugates of the antibody and cytotoxic agent are made using a variety of 

bifunctional protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate 
(SPDP), iminothiolane (IT), bifunctional derivatives of imidoesters (such as dimethyl 
adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes (such as 
glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis- 

25 diazonium derivatives (such as bis-(p-diazoniumbenzoyl)-ethylenediamine)j diisocyanates 
(such as tolyene 2,6-diisocyanate), and bis-active fluorine compounds (such as 1,5-difluoro- 
2,4-dinitrobenzene). For example, a ricin immunotoxin can be prepared as described in 
Vitetta et al., Science , 238: 3 098 (1987). Carbon- 14-labeled l-isothiocyanatobenzyl-3- 
methyldi ethylene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating agent for 

30 conjugation of radionucleotide to the antibody. See W094/1 1 026. 

In another embodiment, the antibody can be conjugated to a "receptor" (such 
streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate is 
administered to the patient, followed by removal of unbound conjugate from the circulation 
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using a clearing agent and then administration of a "ligand" (e.g., avi din) that is in turn 
conjugated to a cytotoxic agent. 

Immunoliposomes 

The antibodies disclosed herein can also be formulated as immunoliposomes. 
Liposomes containing the antibody are prepared by methods known in the art, such as 
described in Epstein et ai. s Proc. Natl. Acad. Sci. USA , 82: 3688 (1985); Hwang et al., Proc. 
Natl Acad. Sci. USA . 77: 4030 (1980); and U.S. Pat. Nos. 4,485,045 and 4,544,545. 
Liposomes with,enhanced circulation time are disclosed in U.S. Patent No. 5,013,556. 

Particularly useful liposomes can be generated by the reverse-phase evaporation 
method with a* lipid composition comprising phosphatidylcholine, cholesterol, and PEG- 
derivatized phosphatidylethanolamine (PEG-PE). Liposomes are extruded through filters of 
defined pore size to yield liposomes with the desired diameter. Fab' fragments of the antibody 
of the present invention can be conjugated to the liposomes as described in Martin et al 
Biol. Chem. , 257: 286-288 (1982) via a disulfide-interchange reaction. A chemotherapeutic 
agent (such as Doxorubicin) is optionally contained within the liposome. See Gabizon et al., J. 
National Cancer Inst. , 81(19): 1484 (1989). 

Diagnostic Applications of Antibodies DirectecJ Against the Proteins of the Invention 

Antibodies directed against a protein of the invention may be used in methods known 
within the art relating to the localization and/or quantitation of the protein (e.g., for use in 
measuring levels of the protein within appropriate physiological samples, for use in diagnostic 
methods, for use in imaging the protein, and the like). In a given embodiment, antibodies 
against the proteins, or derivatives, fragments, analogs or homologs thereof, that contain the 
antigen binding domain, are utilized as pharmacologically-active compounds (see below). 

An antibody specific for a protein of the invention can be used to isolate the protein by 
standard techniques, such as immunoaffmity chromatography or immunoprecipitation. Such 
an antibody can facilitate the purification of the natural protein antigen from cells and of 
recombinantly produced antigen expressed in host cells. Moreover, such an antibody can be 
used to detect the antigenic protein (e.g., in a cellular lysate or cell supernatant) in order to 
evaluate the abundance and pattern of expression of the antigenic protein. Antibodies directed 
against the protein can be used diagnostically to monitor protein levels in tissue as part of a 
clinical testing procedure, e.g., to, for example, determine the efficacy of a given treatment 
regimen. Detection can be facilitated by coupling (i.e., physically linking) the antibody to a 
detectable substance. Examples of detectable substances include various enzymes, prosthetic 
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groups, fluorescent materials, luminescent materials, bioluminescent materials, and radioactive 
materials. Examples of suitable enzymes include horseradish peroxidase, alkaline 
phosphatase, P-galactosidase, or acetylcholinesterase; examples of suitable prosthetic group 
complexes include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent 
5 materials include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, 
dichlorotriazinyl amine fluorescein, dansyl chloride or phycoerythrin; an example of a 
luminescent material includes luminol; examples of bioluminescent materials include 
luciferase, luciferin, and aequorin, and examples of suitable radioactive material include 125 I, 
,3, I, 35 Sor 3 H. 

) I 

1 0 Antibody Therapeutics 

, , Antibodies of the invention, including polyclonal, monoclonal, humanized and fully 
human antibodies, may used as therapeutic agents. Such agents will generally be employed to 
treat or prevent a disease or pathology in a subject'. An antibody preparation, preferably one 
having high specificity and high affinity for its target antigen, is administered to the subject 

1 5 and will generally have an effect due to its binding with the target. Such an effect may be one 
of two kinds, depending on the specific nature of the interaction between the given antibody 
molecule and the target antigen in question. In the first instance, administration of the 
antibody may abrogate or inhibit the binding of the target with an endogenous ligand to which 
it naturally binds. In this case, the antibody binds to the target and masks a binding site of the 

20 naturally occurring ligand, wherein the ligand serves as an effector molecule. Thus the 
receptor mediates a signal transduction pathway for which ligand is responsible. 

Alternatively, the effect may be one in which the antibody elicits a physiological result 
by virtue of binding to an effector binding site on the target molecule. In this case the target, a 
receptor having an endogenous ligand which may be absent or defective in the disease or 

25 pathology, binds the antibody as a surrogate effector ligand, initiating a receptor-based signal 
transduction event by the receptor. , 

A therapeutically effective amount of an antibody of the invention relates generally to 
the amount needed to achieve a therapeutic objective. As noted above, this may be a binding 
interaction between the antibody and its target antigen that, in certain cases, interferes with the 

30 functioning of the target, and in other cases, promotes a physiological response. The amount 
required to be administered will furthermore depend on the binding affinity of the antibody for 
its specific antigen, and will also depend on the rate at which an administered antibody is 
depleted from the free volume other subject to which it is administered. Common ranges for 
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therapeutically effective dosing of an antibody or antibody fragment of the invention may be, 
by way of nonlimiting example, from about 0.1 mg/kg body weight to about 50 mg/kg body - 
weight. Common dosing frequencies may range, for' example, from twice daily to once a. 
week. 

5 Pharmaceutical Compositions of Antibodies 

Antibodies specifically binding a protein of the invention, as well as other molecules 
identified by the screening assays disclosed herein, can be administered for the treatment of 
various disorders in the form of pharmaceutical compositions. Principles and considerations 
involved in preparing such compositions, as well as guidance in the choice of components are 

10 provided, for example, in Remington : The Science And Practice Of Pharmacy 19th ed. 
(Alfonso R. Gennaro, et ah, editors) Mack Pub. Co., Easton, Pa. : 1995; Drug Absorption 
Enhancement : Concepts, Possibilities, Limitations, And Trends, Harwood Academic 
Publishers, Langhorne, Pa., 1994; and Peptide And Protein Drug Delivery (Advances In 
Parenteral Sciences, Vol. 4), 1991, M. Dekker, New York. 
.: 15 If the antigenic protein is intracellular and whole antibodies are used as inhibitors, 

internalizing antibodies are preferred. However, liposomes can also be used to deliver the 
antibody, or an antibody fragment, into cells. Where antibody fragments are used, the smallest 
inhibitory fragment that specifically binds to the binding domain of the target protein is 
% preferred. For example, based upon the variable-region sequences of an antibody, peptide 

20 molecules can be designed that retain the ability to bind the target protein sequence. Such 
peptides can be synthesized chemically and/or produced by recombinant DNA technology. 
See, e.g., Marasco et al., Proc. Natl. Acad. Sci. USA . 90: 7889-7893 (1993). The formulation 
herein can also contain more than one active compound as necessary for the particular 
indication being treated, preferably those with complementary activities that do not adversely 

25 affect each other. Alternatively, or in addition, the composition can comprise an agent that 
enhances its function, such as, for example, a cytotoxic agent, cytokine, chemotherapeutic 
agent, or growth-inhibitory agent. Such molecules are suitably present in combination in 
amounts that are effective for the purpose intended. 

The active ingredients can also be entrapped in microcapsules prepared, for example, 

30 by coacervation techniques or by interfacial polymerization, for example, 

hydroxymethylcellulose or gelatin-microcapsules and poly-(methylmethacrylate) 
microcapsules, respectively, in colloidal drug delivery systems (for example, liposomes, 
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albumin microspheres, microemulsions, nano-particles, and nanocapsules) or in 
macroemulsions. 

The formulations to be ufeed for in vivo administration must be sterile. This is readily 
accomplished by filtration through sterile filtration membranes. 
5 Sustained-release preparations can be prepared. Suitable examples of sustained- 

release preparations include semipermeable matrices of solid hydrophobic polymers 
containing the antibody, which matrices are in the form of shaped articles, e.g., films, br , 
microcapsules. Examples of sustained-release matrices include polyesters, hydrogels (for 
example, poly(2-hydroxyethyl-metbacrylate), or polyvinyl alcohol)), polylactides (U.S. Pat. 

10 No. 3,773,919), copolymers of L- glutamic acid and y ethyl -L-glutamate, non-degradable 

ethylene- vinyl acetate, degradable lactic acid-glycolic acid copolymers such as the LUPRON 
DEPOT ™ (injectable microspheres composed of lactic acid-glycolic acid copolymer and 
leuprolide acetate), and poly-D-(~)-3-hydroxybutyric acid. While polymers such as ethylene- 
vinyl acetate and lactic acid-glycolic acid enable release of molecules for over 100 days, 

1 5 certain hydrogels release proteins for shorter time periods. 

ELISA Assay 

An agent for detecting an analyte protein is an antibody capable of binding to an 
analyte protein, preferably an antibody with a detectable label. Antibodies can be polyclonal, 
or more preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., F aD or F( a b)2) 

20 can be used. The term "labeled", with regard to the probe or antibody, is intended to 

encompass direct labeling of the probe or antibody by coupling (i.e., physically linking) a 
detectable substance to the probe or antibody, as well as indirect labeling of the probe or 
antibody by reactivity with another reagent that is directly labeled. Examples of indirect 
labeling include detection of a primary antibody using a fluorescently-labeled secondary 

25 antibody and end-labeling of a DNA probe with biotin such that it can be detected with 

fluorescently-labeled streptavidin. The term "biological sample" is intended to include tissues, 
cells and biological fluids isolated from a subject, as well as tissues, cells and fluids present 
within a subject. Included within the usage of the term "biological sample", therefore, is 
blood and a fraction or component of blood including blood serum, blood plasma, or lymph. 

30 That is, the detection method of the invention can be used to detect an analyte mRNA, protein, 
or genomic DNA in a biological sample in vitro as well as in vivo. For example, in vitro 
techniques for detection of an analyte mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detection of an analyte protein include enzyme linked 

188 



BNSDOCID: <WO_ 02055704 A2J_> 



WO 02/055704 PCT/US02/00554 

immunosorbent assays (ELISAs), Western blots, immunoprecipitations, and 
immunofluorescence. In Vitro tecriniques for detection of an analyte genomic DNA include 
Southern hybridizations. 'Procedures for conducting inxrnunoassays are des£ribedj for example 
in "ELISA: Theory and Practice: Methods in Molecular Biology", Vol. 42, J. R. Crowther 
5 (Ed.) Human Press, Totowa, NJ, 1995; "Immunoassay", E. Diamandis and T. Christopoulus, 
Academic Press, Inc., San Diego, CA, 1996; and "Practice and Thory of Enzyme 
Immunoassays", P. Tijssen, Elsqvier Scierice Publishers, Amsterdam, 1985. Furthermbre^ in 
vivo techniques for .detection of an analyte protein include introducing into a subject a labeled 
anti~an analytb protein antibody. F t or example, the antibody can be labeled with a radioactive 
1 0 marker whose presence and location in a subject can be detected by standard imaging 
techniques. 

NOVX Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression vectors, 
containing a nuclbic acid encoding an NOVX protein, or derivatives, fragments, analogs or 

15 homologs thereof. As used herein, the term "vector" refers to a nucleic acid molecule capable 
of transporting another nucleic acid to which it has been linked. One type of vector is a 
"plasmid", which refers to a circular double stranded. DNA loop into which additional DNA 
segments can be ligated. Another type of vector is a viral vector, wherein additional DNA 
5 segments cm be ligated into the viral genome. Certain vectors are capable of autonomous 

20 replication in a host cell into which they are introduced (e.g., bacterial vectors having a 
bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., 
non-episomal mammalian vectors) are integrated into the genome of a host cell upon 
introduction into the host cell, and thereby are replicated along with the host genome. 
Moreover, certain vectors are capable of directing the expression of genes to which they are 

25 operatively-linked. Such vectors are referred to herein as "expression vectors". In general, 

expression vectors of utility in recombinant DNA techniques are often in the form of plasmids. 
In the present specification, "plasmid" and "vector" can be used interchangeably as the 
plasmid is the most commonly used form of vector. However, the invention is intended to 
include such other forms of expression vectors, such as viral vectors (e.g., replication defective 

30 retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of the 
invention in a form suitable for expression of the nucleic acid in a host cell, which means that 
the recombinant expression vectors include one or more regulatory sequences, selected on the 
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basis of the host cells to be used for expression, that is operatively-linked to the nucleic acid 
sequence to be expressed. Within a recombinant expression vector, "operably-linked" is 
intended to mean that the nucleotide sequence of interest is linked to the regulatory 
sequence(s) in a manner that allows for expression of the nucleotide sequence {e.g., in an in 
5 vitro transcription/translation system or in a host cell when the vector is introduced into the 
host cell). . 

The term "regulatory sequence" is intended to includes promoters, enhancers and other 
expression control elements (eg., polyadenylation signals). Such regulatory sequences are 
described, for example, in Goeddel, GENB EXPRESSION TECHNOLOGY: METHODS IN 
10 £nzymology 185, Academic Press, San Diego, Calif. (1990). Regulatory sequences include 
those that direct constitutive expression of a nucleotide sequence in many types of host cell 
'and those that direct expression of the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art that the 
design of the expression vector can depend on such factors as the choice of the host cell to be 
1 5 transformed, the level of expression of protein desired, etc. TJie expression vectors of the 

invention can be introduced into host cells to thereby produce proteins or peptides, including 
fusion proteins or peptides, encoded by nucleic acids as described herein (e.g., NOVX 
proteins, mutant forms of NOVX proteins, fusion proteins, etc.). 

The recombinant expression vectors of the invention can be designed for expression of 
20 NOVX proteins in prokaryotic or eukaryotic cells. For example, NOVX proteins can be 

expressed in bacterial cells such as Escherichia coli, insect cells (using baculovirus expression 
vectors) yeast cells or mammalian cells. Suitable host cells are discussed further in Goeddel, 
Gene Expression Technology: Methods in Enzymology 185, Academic Press, San 
Diego, Calif. (1990). Alternatively, the recombinant expression vector can be transcribed and 
25 translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out in Escherichia coli with 
vectors containing constitutive or inducible promoters directing the expression of either fusion 
or non- fusion proteins. Fusion vectors add a number of amino acids to a protein encoded 
therein; usually to the amino terminus of the recombinant protein. Such fusion vectors 
30 typically serve three purposes: (i) to increase expression of recombinant protein; (zi) to 

increase the solubility of the recombinant protein; and (Hi) to aid in the purification of the 
recombinant protein by acting as a ligand in affinity purification. Often, in fusion expression 
vectors, a proteolytic cleavage site is introduced at the junction of the fusion moiety and the 
recombinant protein to enable separation of the recombinant protein from the fusion moiety 
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subsequent to purification of the fusion protein. Such enzymes, and their cognate recognition 
sequences, include Factor Xa, thrombin and enterokinase. Typical fusion expression vectors - 
include pGEX (Pharmacia Biotech Inc; Smith and Johnson, 1988.'. Gene 67: 31-40), pMAL 
(New England Biolabs, Beverly, Mass.) and pRIT5 (Pharmacia, Piscataway, N.J.) that fuse 
5 glutathione S-transferase (GST), maltose E binding protein, or protein A, respectively, to the 
target recombinant protein. 

Examples of suitable inducible non-fusion E. coli expression vectors include pTrc 
(Amrann et aL, (1988) Gene 69:301-315) and pET lid (Studier et aL, Gene Expression 
Technology: Methods in EnzJymology 185, Academic Press, San Diego, Calif. (1990) 
10 '60^89). 

One strategy to maximize recombinant protein expression in E. coli is to express the 
1 protein in a host bacteria with an impaired capacity to proteolytically cleave the recombinant 
protein. See, e.g., Gottesman, Gene Expression Technology: Methods in Enzymology 
185, Academic Press, San Diego, Calif. (1990) 119-128. Another strategy is to alter the 
1 5 nucleic acid sequence of the nucleic acid to be inserted into an expression vector so that the 
individual codons for each amino acid are those preferentially utilized in E. coli {see, e.g., 
Wada, et aL, 1992. Nucl Acids Res. 20: 21 1 1-21 1 8). Such alteration of nucleic acid 
sequences of the invention can be carried out by standard DNA synthesis techniques. 

In another embodiment, the NOVX expression vector is a yeast expression vector. 
20 Examples of vectors for expression in yeast Saccharomyces cerivisae include pYepSecl 

(Baldari, et aL, 1987. EMBOJ. 6: 229-234), pMFa (Kurjan and Herskowitz, 1982. Cell 30: 
933-943), pJRY88 (Schultz et aL, 1987. Gene 54: 1 13-123), pYES2 (Invitrogen Corporation, 
San Diego, Calif), and picZ (InVitrogen Corp, San Diego, Calif). 

Alternatively, NOVX can be expressed in insect cells using baculovirus expression 
25 vectors. Baculovirus vectors available for expression of proteins in cultured insect cells {e.g., 
SF9 cells) include the pAc series (Smith, et aL, 1983. Mol Cell. Biol. 3: 2156-2165) and the 
pVL series (Lucklow and Summers, 1989. Virology 170: 31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in mammalian 
cells using a mammalian expression vector. Examples of mammalian expression vectors 
30 include pCDMS (Seed, 1987. Nature 329: 840) and pMT2PC (Kaufman, et aL, 1987. EMBO 
J. 6: 1 87-195). When used in mammalian cells, the expression vector's control functions are 
often provided by viral regulatory elements. For example, commonly used promoters are 
derived from polyoma, adenovirus 2, cytomegalovirus, and simian virus 40. For other suitable 
expression systems for both prokaryotic and eukaryotic cells see, e.g., Chapters 16 and 17 of 
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Sambrook, et aL, Molecular Cloning: A Laboratory Manual. 2nd ed., Cold Spring 
Harbor Laboratory, Cold Spring liarbor Laboratory Press, Cold Spring Harbor, N.Y., 1989. 

In another embodiment, die recombinant mammalian expression vector is capable of 
directing expression of the nucleic acid preferentially in a particular cell type (e.g., 
5 tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific 
regulatory elements are known in the art. Non-limiting examples of suitable tisspe-specific 
promoters include the albumin promoter (liver-specific; Pinkert, et aL, 1987. Genes Dev. 1 : 
268-277), lymphoid-specific promoters (Calame and Eaton, 1988. Adv. Immunol 43: 
235-275), in particular promoters of T cell receptors (Winoto and Baltimore, 1989. EMBO J. 

10 8: 729-733) and immunoglobulins (Banerji, et aL, 1983. Cell 33: 729-740; Queen and 

Baltimore, 1983. Cell 33: 741-748), neuron-specific promoters {e.g., the neurofilament ' 
promoter; Byrne and Ruddle, 1989. Proa Natl. Acad. ScL USA 86: 5473-5477), 
pancreas-specific promoters (Edlund, et aL, 1985. Science 230: 912-916), and mammary 
gland-specific promoters {e.g., milk whey promoter; U.S. Pat. No'. 4,873,316 and European 

15 Application Publication No. 264,166). Deyelopmentally-regulated promoters are also 

encompassed, e.g., the murine hox promoters (Kessel and Gruss, 1990. Science 249: 374-379) 
and the a-fetoprotein promoter (Campes and Tilghman, 1989. Genes Dev. 3: 537-546). 

The invention further provides a recombinant expression vector comprising a DNA 
molecule of the invention cloned into the expression vector in an antisense orientation. That 

20 is, the DNA molecule is operatively-linked to a regulatory sequence in a manner that allows 

for expression (by transcription of the DNA molecule) of an RNA molecule that is antisense to 
NOVX mRNA. Regulatory sequences operatively linked to a nucleic acid cloned in the 
antisense orientation can be chosen that direct the continuous expression of the antisense RNA 
molecule in a variety of cell types, for instance viral promoters and/or enhancers, or regulatory 

25 sequences can be chosen that direct constitutive, tissue specific or cell type specific expression 
of antisense RNA. The antisense expression vector can be in the form of a recombinant 
plasmid, phagemid or attenuated virus in which antisense nucleic acids are produced under the 
control of a high efficiency regulatory region, the activity of which can be detennined by the 
cell type into which the vector is introduced. For a discussion of the regulation of gene 

30 expression using antisense genes see, e.g., Weintraub, et aL, "Antisense RNA as a molecular 
tool for genetic analysis," Reviews-Trends in Genetics, Vol. 1(1) 1986. 

Another aspect of the invention pertains to host cells into which a recombinant 
expression vector of the invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. It is understood that such terms refer 
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not only to the particular subject cell but also to the progeny or potential progeny of such a 

cell Because certain modifications may occur in succeeding generations due to either 

mutation or environmental influences, such progeny may not, ifi fact, be identical to the parent 

cell, but are still included within the scope of the term as used herein. 

5 A host cell can be any prokaryotic or eukaryotic cell. For example, NOVX protein can 

be expressed in bacterial cells such as E. coii 9 insect cells, yeast or mammalian cells (such as 

Chinese hamster ovary cells (CHO) or COS cells). Other suitable host cells are known to 

those skilled in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional 
i i 
10 transformation or transfection techniques. As used herein, the terms "transformation" and 

"transfection" are intended to refer to a variety of art-recognized techniques for introducing 

' foreign nucleic acid {e.g., DNA) into a host cell, including calcium phosphate or calcium 

chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, or 

electroporation. Suitable methods for transforming or transfecting host cells can be found in 

1 5 Sambrook, et al. (Molecular Cloning: A Laboratory Manual. 2nd ed., Cold Spring 
Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989), 
and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may integrate 

20 the foreign DNA into their genome. In order to identify and select these integrants, a gene that 
encodes a selectable marker {e.g., resistance to antibiotics) is generally introduced into the 
host cells along with the gene of interest. Various selectable markers include those that confer 
resistance to drugs, such as G418, hygromycin and methotrexate. Nucleic acid encoding a 
selectable marker can be introduced into a host cell on the same vector as that encoding 

25 NOVX or can be introduced on a separate vector. Cells stably transfected with the introduced 
nucleic acid can be identified by drug selection (e.g., cells that have incorporated the 
selectable marker gene will survive, while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, can 
be used to produce (i.e., express) NOVX protein. Accordingly, the invention further provides 

30 methods for producing NOVX protein using the host cells of the invention. In one 

embodiment, the method comprises culturing the host cell of invention (into which a 
recombinant expression vector encoding NOVX protein has been introduced) in a suitable 
medium such that NOVX protein is produced. In another embodiment, the method further 
comprises isolating NOVX protein from the medium or the host cell. 
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Transgenic NOVX Animals 

.» ■ ■ . 

The host cells of the invention can also be used to produce non-human transgenic 

animals. For example, in one embodiment, a Jiost cell of the invention is a fertilized oocyte or 
i 

an embryonic stem cell into which NOVX protein-coding sequences have been introduced. 
5 Such host cells can then be used to create non-human transgenic animals in which exogenous 
NOVX sequences have been introduced into their genome or homologous recombinant 
animals in which endogenous NOVX sequences have been altered. Such animals are useful 
for studying the function and/or activity of NOVX protein and for identifying and/or 
evaluating modulators of NOVX protein activity. As used herein, a "transgenic animal" is a 

1 0 non-human animal, preferably a mammal, more preferably a rodent such as a rat or mouse, in 
which one or more of the cells of the animal includes a transgene. Other examples of 
transgenic animals include non-human primates, sheep, dogs, cows, goats, chickens, 
amphibians, etc. A transgene is exogenous DNA that is integrated into the genome of a cell 
from which a transgenic animal develops and that remains in the genome of the mature 

15 animal, thereby directing the expression of an encoded gene product in one or more cell types 
or tissues of the transgenic animal. As used herein, a "homologous recombinant animal" is a 
non-human animal, preferably a mammal, more preferably a mouse, in which an endogenous 
NOVX gene has been altered by homologous recombination between the endogenous gene 
and an exogenous DNA molecule introduced into a cell of the animal, e.g., an embryonic cell 

20 of the animal, prior to development of the animal. 

A transgenic animal of the invention can be created by introducing NOVX-encoding 
nucleic acid into the male pronuclei of a fertilized oocyte (e.g., by microinjection, retroviral 
infection) and allowing the oocyte to develop in a pseudopregnant female foster animal. The 
human NOVX cDNA sequences SEQ IDNOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 

25 29, 31, 33, 34, 35, 37, 39, 41, 43, 45, 47 and 49 can be introduced as a transgene into the 

genome of a non-human animal Alternatively, a non-human homologue of the human NOVX 
gene, such as a mouse NOVX gene, can be isolated based on hybridization to the human 
NOVX cDNA (described further supra) and used as a transgene. Intronic sequences and 
polyadenylation signals can also be included in the transgene to increase the efficiency of 

30 expression of the transgene. A tissue-specific regulatory sequence(s) can be operably-linked 

to the NOVX transgene to direct expression of NOVX protein to particular cells. Methods for 

generating transgenic animals via embryo manipulation and microinjection, particularly 

animals such as mice, have become conventional in the art and are described, for example, in 

U.S. Patent Nos. 4,736,866; 4,870,009; and 4,873,191; and Hogan, 1986. In: MANIPULATING 
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THE MOUSE Embryo, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.' 
Similar methods are used for production of other transgenic animals. A transgenic founder 
animal can be identified based upon the presence of the NOVX transgene in its genome and/or 
expression of NOVX mRNA in tissues or cells of the animals. A transgenic founder animal 
5 can then be used to breed additional animals carrying the transgene. Moreover, transgenic 
animals carrying a transgene-encoding NOVX protein can further be bred to other transgenic 
animals carrying other transgenes. ' ; 

To create a homologous recombinant animal, a vector is prepared which contains at 
least a portion of an NOVX gene into which a deletion, addition or substitution has been 
10 introduced to thereby alter, e.g., functionally disrupt, the NOVX gene. The NOVX gene can 
beahumangene(e.^.,thecDNAofSEQIDNOS:l,3 s 5, 7, 9,11, 13, 15, 17, 19,21,23^25, 
27, 29, 31, 33, 34, 35, 37, 39, 41, 43, 45, 47 arid, 49), but more preferably, is a non-human 
homologue of a human NOVX gene. For example, a mouse homologue of human NOVX 
gene of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17,' 19, 21, 23, 25, 27, 29, 31, 33', 34, 35, 37, 39, 
15 41, 43, 45, 47 and 49 can be used to construct a homologous recombination vector suitable for 
altering an endogenous NOVX gene in the mouse genome, hi one embodiment, the vector is 
designed such that, upon homologous recombination, the endogenous NOVX gene is 
functionally disrupted (i.e., no longer encodes a functional protein; also referred to as a "knock 
out" vector). 

20 Alternatively, the vector can be designed such that, upon homologous recombination, 

the endogenous NOVX gene is mutated or otherwise altered but still encodes functional 
protein (e.g., the upstream regulatory region can be altered to thereby alter the expression of 
the endogenous NOVX protein). In the homologous recombination vector, the altered portion 
of the NOVX gene is flanked at its 5'- and 3 , -termini by additional nucleic acid of the NOVX 

25 gene to allow for homologous recombination to occur between the exogenous NOVX gene 
carried by the vector and an endogenous NOVX gene in an embryonic stem cell. The 
additional flanking NOVX nucleic acid is of sufficient length for successful homologous 
recombination with the endogenous gene. Typically, several kilobases of flanking DNA (both 
at the 5'- and 3 -termini) are included in the vector. See, e.g., Thomas, et ah, 1987. Cell 51: 

30 503 for a description of homologous recombination vectors. The vector is ten introduced into 
an embryonic stem cell line (e.g., by electroporation) and cells in which the introduced NOVX 
gene has homologously-recombined with the endogenous NOVX gene are selected. See, e.g., 
Li, etal, 1992. Ce//69: 915. 
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The selected cells are then injected into a blastocyst of an animal {e.g., a mouse) to 
" form aggregation chimeras. See, e.g., Bradley, 1987. In: TERATOCARCINOMAS AND 

Embryonic Stem Cells: A Practical Approach, Robertson, ed. IRL, Oxford, pp. 1 13-152. 
A chimeric embryo can then be implanted into a suitable pseudopregnant female foster animal 
5 and the embryo brought to term. Progeny harboring the homologously-recombined DNA in 
their genn cells can be used to breed animals in which all cells of the animal contain the 
homologously-recombined DNA by germline transmission of the transgene. Methods for 
constructing homologous recombination vectors and homologous recombinant animals are 
described further in Bradley, 1991. Curr. Opin. Biotechnol 2: 823-829; PCT International 

10 Publication Nos.: WO 90/1 1354; WO 91/01 140; WO 92/0968; and WO 93/04169. 

In another embodiment, transgenic non-humans animals can be produced that contain 
■ selected systems that allow for regulated expression of the transgene. One example of such a 
system is the cre/loxP recombinase system of bacteriophage PI . For a description of the 
cre/loxP recombinase system, See, e.g., Lakso, et al., 1992. Proc. Natl. Acad. Set USA 89: 

1 5 6232-6236. Another example of a recombinase system is the FLP recombinase system of 

Saccharomyces cerevisiae. See, O'Gorman, et ah, 1991. Science 251:1351-1355. Ifacre/loxP 
recombinase system is used to regulate expression of the transgene, animals containing 
transgenes encoding both the Cre recombinase and a selected protein are required. Such 
animals can be provided through the construction of "double" transgenic animals, e.g., by 

20 mating two transgenic animals, one containing a transgene encoding a selected protein and the 
other containing a transgene encoding a recombinase. 

Clones of the non-human transgenic animals described herein can also be produced 
according to the methods described in Wilmut, et al., 1997. Nature 385: 810-813. In brief, a 
cell (e.g., a somatic cell) from the transgenic animal can be isolated and induced to exit the 

25 growth cycle and enter Go phase. The quiescent cell can then be fused, e.g., through the use of 
electrical pulses, to an enucleated oocyte from an animal of the same species from which the 
quiescent cell is isolated. The reconstructed oocyte is then cultured such that it develops to 
morula or blastocyte and then transferred to pseudopregnant female foster animal. The 
offspring borne of this female foster animal will be a clone of the animal from which the cell 

30 {e.g., the somatic cell) is isolated. 

Pharmaceutical Compositions 

The NOVX nucleic acid molecules, NOVX proteins, and anti-NOVX antibodies (also 
referred to herein as "active compounds") of the invention, and derivatives, fragments, analogs 
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and homologs thereof, can be incorporated into pharmaceutical compositions suitable for 
administration. Such compositions typically comprise the nucleic acid molecule, protein, or 
antibody and a pharmaceutical^ acceptable carrier. As used herein, ' "pharmaceutical! y 
acceptable carrier" is intended to include any and all solvents, dispersion media, coatings, 
5 antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like, 
compatible with pharmaceutical administration. Suitable carriers are described in the most 
recent edition of Remington's Pharmaceutical Sciences, a standard reference text in the field, 
which is incorporated herein by reference. Preferred examples of such carriers or diluents 
include, but are not limited to, water, saline, finger's solutions, dextrose solution, and 5% 

10 human serum albumin. Liposomes and non-aqueous vehicles such as fixed oils may also be 
used. The use of such media and agents forpharmaceutically active substances is well known 
in the art. Except insofar as any conventional media or agent is incompatible with the active 
compound, use thereof in the compositions is contemplated. Supplementary active 
compounds can also be incorporated into the compositions. 

15 A pharmaceutical composition of the invention is formulated to be compatible with its 

intended route of administration. Examples of routes of administration include parenteral, 
e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), transdermal (i.e., topical), 
transmucosal, and rectal administration. Solutions or suspensions used for parenteral, 
intradermal, or subcutaneous application can include the following components: a sterile 

10 diluent such as water for injection, saline solution, fixed oils, polyethylene glycols, glycerine, 
propylene glycol or other synthetic solvents; antibacterial agents such as benzyl alcohol or 
methyl parabens; antioxidants such as ascorbic acid or sodium bisulfite; chelating agents such 
as ethylenediaminetetraacetic acid (EDTA); buffers such as acetates, citrates or phosphates, 
and agents for the adjustment of tonicity such as sodium chloride or dextrose. The pH can be 

15 adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The parenteral 
preparation can be enclosed in ampoules, disposable syringes or multiple dose vials made of 
glass or plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions' (where water soluble) or dispersions and sterile powders for the extemporaneous 
\0 preparation of sterile injectable solutions or dispersion. For intravenous administration, 

suitable carriers include physiological saline, bacteriostatic water, Cremophor EL™ (BASF, 
Parsippany, N J.) or phosphate buffered saline (PBS). In all cases, the composition must be 
sterile and should be fluid to the extent that easy syringeability exists. It must be stable under 
the conditions of manufacture and storage and must be preserved against the contaminating 
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action of microorganisms such as bacteria and fungi. The carrier can be a solvent or ■ 
dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, 
propylene glycol, and liquid polyethylene glycol, and the like), and suitable mixtures thereof. 
The proper fluidity can be maintained, for example, by the use of a coating such as lecithin, by 
5 the maintenance of the required particle size in the case of dispersion and by the use of 
surfactants. Prevention of the action of microorganisms can be achieved by various 
antibacterial and antifungal agents, for example, paraberis, chlorobutanol, phenol, ascorbic 
acid, thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, 
for example, sugars, poiyalcohols such as manitol, sorbitol, sodium chloride in the 

10 composition. Prolonged absorption of the injectable compositions can be brought about by 
including in the composition an agent which delays absorption, for example, aluminum 
monostearate and gelatin. 1 

Sterile injectable solutions pan be prepared by incorporating the active compound (e.g., 
an NOVX protein or anti-NOVX antibody) in the required amount in an appropriate solvent 

15 with one or a combination of ingredients enumerated above, as required, followed by filtered 
sterilization. Generally, dispersions are prepared by incorporating the active compound into a 
sterile vehicle that contains a basic dispersion medium and the required other ingredients from 
those enumerated above. In the case of sterile powders for the preparation of sterile injectable 
solutions, methods of preparation are vacuum drying and freeze-drying that yields a powder of 

20 the active ingredient plus any additional desired ingredient from a previously sterile-filtered 
solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They can be 
enclosed in gelatin capsules or compressed into tablets. For the purpose of oral therapeutic 
administration, the active compound can be incorporated with excipients and used in the form 

25 of tablets, troches, or capsules. Oral compositions can also be prepared using a fluid carrier 
for use as a mouthwash, wherein the compound in the fluid carrier is applied orally and 
swished and expectorated or swallowed. Pharmaceutically compatible binding agents > and/or 
adjuvant materials can be included as part of the composition. The tablets, pills, capsules, 
troches and the like can contain any of the following ingredients, or compounds of a similar 

30 nature: a binder such as microcrystalline cellulose, gum tragacanth or gelatin; an excipient 

such as starch or lactose, a disintegrating agent such as alginic acid, Primogel, or corn starch; a 
lubricant such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a 
sweetening agent such as sucrose or saccharin; or a flavoring agent such as peppermint, 
methyl salicylate, or orange flavoring. 
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For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from pressured container or dispenser which contains a suitable propellant, e.g., 
a gas such as carbon dioxide, or 'a nebulizer. ' * 

Systemic administration can also be by transmucosal or transdermal means. For 
transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, and 
include, for example, for transmucosal administration, detergents, bile salts, and fusidic acid 
derivatives. Transmucosal administration can be accomplished through the use of nasal sprays 
or suppositories. For transdermal administration, the active compounds are formulated into 
ointments, salves, gels, or creams as generally known in the art. 

The compounds can also be prepared in the form of suppositories (e.g, y with 
conventional suppository bases such as cocoa butter and other glycerides) or retention enemas 
for rectal delivery. ( 

In one embodiment, the active compounds are prepared with carriers that will protect 
the compound against rapid elimination from the body, such as a controlled release 
formulation, including implants and microencapsulated delivery systems. Biodegradable, 
biocompatible polymers can be used, such as ethylene vinyl acetate, polyanhydrides, 
polyglycolic acid, collagen, polyorthoesters, and polylactic acid. Methods for preparation of 
such formulations will be apparent to those skilled in the art. The materials can also be 
obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. Liposomal 
suspensions (including liposomes targeted to infected cells with monoclonal antibodies to viral 
antigens) can also be used as pharmaceutically acceptable carriers. These can be prepared 
according to methods known to those skilled in the art, for example, as described in U.S. 
Patent No. 4,522,811. 

It is especially advantageous to formulate oral or parenteral compositions in dosage 
unit form for ease of administration and uniformity of dosage. Dosage unit form as used 
herein refers to physically discrete units suited as unitary dosages for the subject to be treated; 
each unit containing a predetermined quantity of active compound calculated to produce the 
desired therapeutic effect in association with the required pharmaceutical carrier. The 
specification for the dosage unit forms of the invention are dictated by and directly dependent 
on the unique characteristics of the active compound and the particular therapeutic effect to be 
achieved, and the limitations inherent in the art of compounding such an active compound for 
the treatment of individuals. 
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The nucleic acid molecules of the invention can be inserted into vectors and used as 
" gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for example, 
intravenous injection, local administration (see, e.g., U.S. Patent No, 5,328,470) or by 
stereotactic injection (see, e.g., Chen, et al, 1994. Proc. Natl. Acad. Sci. USA 91: 3054-3057). 
5 The pharmaceutical preparation of the gene therapy vector can include the gene therapy vector 
in an acceptable diluent, or can comprise a .slow release matrix in which the gene delivery 
vehicle is imbedded. Alternatively, where the complete gene delivery vector can be produced 
intact from recombinant cells, e.g., retroviral vectors, the pharmaceutical preparation can 
include one or more cells that prbduce the gene delivery system. 
10 ' , The pharmaceutical compositions can be included in a container, pack, or dispenser 
together with instructions for administration. 

1 Screening and Detection Methods 

The isolated nucleic acid molecules of the invention can be used to express NOVX 
protein (e.g., via a recombinant expression vector in a host cell in gene therapy applications), 

15 to detect NOVX inRNA (e.g., in a biological sample) or a genetic lesion in an NOVX gene, 
and to modulate NOVX activity, as described further, below. In addition, the NOVX proteins 
can be used to screen drugs or compounds that modulate the NOVX protein activity or 
expression as well as to treat disorders characterized by insufficient or excessive production of 
NOVX protein or production of NOVX protein forms that have decreased or aberrant activity • 

20 compared to NOVX wild-type protein (e.g.; diabetes (regulates insulin release); obesity (binds 
and transport lipids); metabolic disturbances associated with obesity, the metabolic syndrome 
X as well as anorexia and wasting disorders associated with chronic diseases and various 
cancers, and infectious disease(possesses anti-mi crobial activity) and the various 
dyslipidemias. In addition, the anti-NOVX antibodies of the invention can be used to detect 

25 and isolate NOVX proteins and modulate NOVX activity. In yet a further aspect, the invention 
can be used in methods to influence appetite, absorption of nutrients and the disposition of 
metabolic substrates in both a positive and negative fashion. 

The invention further pertains to novel agents identified by the screening assays 
described herein and uses thereof for treatments as described, supra. 

30 Screening Assays 

The invention provides a method (also referred to herein as a "screening assay") for 
identifying modulators, i.e., candidate or test compounds or agents (e.g., peptides, 
peptidomimetics, small molecules or other drugs) that bind to NOVX proteins or have a 
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stimulatory or inhibitory effect on, e.g., NOVX protein expression or NOVX protein activity. 
The invention also includes compounds identified in the screening assays described herein. 

In one embodiment, the invention provides assays for screening candidate" or test 
compounds which bind to or modulate the activity of the membrane-bound form of an NOVX 
5 protein or polypeptide or biologically-active portion thereof. The test compounds of the 
invention can be obtained using any of the numerous approaches in combinatorial library 
methods known in the art, including: biological libraries; spatially addressable parallel solid 
phase or solution phase libraries; synthetic library methods requiring deconvolution; the 
"one-bead one-compound" library method; and synthetic library methods using affinity 
10 chromatography selection. The biological library approach is limited to peptide libraries, 
while the other four approaches are applicable to peptide, non-pep tide oligomer or small 
molecule libraries of compounds. See, e.g., Lam, 1997, Anticancer Drug Design 12: 145. 

A "small molecule" as usqd. herein, is meant to refer to a composition that has a 
molecular weight of less than about 5 kD and most preferably less than about '4 kD. Small 
1 5 molecules can be, e.g., nucleic acids, peptides, polypeptides, peptidomimetics, carbohydrates, 
lipids or other organic or inorganic molecules. Libraries of chemical and/or biological 
mixtures, such as fungal, bacterial, or algal extracts, are known in the art and can be screened 
with any of the assays of the invention. 

Examples of methods for the synthesis of molecular libraries can be found in the art, 
20 for example in: DeWitt, et al, 1993. Proc. Natl. Acad. Sci. U.S.A. 90: 6909; Erb, et al., 1994. 
Proc. Natl. Acad. Sci. U.S.A. 91 : 1 1422; Zuckermann, et al, 1994. J. Med. Chem. 37: 2678; 
Gho, et al, 1993. Science 261: 1303; Canrell, et al, 1994. Angew. Chem. Int. Ed. Engl 33: 
2059; Carell, et al., 1994. Angew. Chem. Int. Ed. Engl 33: 2061; and Gallop, et al, 1994. J. 
Med. Chem. 37: 1233. 

25 Libraries of compounds may be presented in solution (e.g., Houghten, 1 992. 

Biotechniques 13: 412-421), or on beads (Lam, 1991. Nature 354: 82-84), on chips (Fodor, 
1993. Nature 364: 555-556), bacteria (Ladner, U.S. Patent No. 5,223,409), spores (Ladner, 
U.S. Patent 5,233,409), plasmids (Cull, et al, 1992. Proc. Natl. Acad. Sci. USA 89: 
1865-1869) or on phage (Scott and Smith, 1990. Science 249: 386-390; Devlin, 1990. Science 

30 249: 404-406; Cwirla, et al, 1990. Proc. Natl. Acad. Sci. U.S.A. 87: 6378-6382; Felici, 1991. 
J. Mol Biol. 222: 301-310; Ladner, U.S. Patent No. 5,233,409.). 

In one embodiment, an assay is a cell-based assay in which a cell which expresses a 
membrane-bound form of NOVX protein, or a biologically-active portion thereof, on the cell 
surface is contacted with a test compound and the ability of the test compound to bind to an 
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NOVX protein determined. The cell, for example, can of mammalian origin or a. yeast cell. 
Determining the ability of the test compound to bind to the NOVX protein can be 
accomplished, for example, by coupling the test compound with a radioisotope or enzymatic 
label such that binding of the test compound to the NOVX protein or biologically-active 
5 portion thereof can be determined by detecting the labeled compound in a complex. For 

example, test compounds can be labeled with 125 I, 35 S, 14 C, or 3 H, either directly or indirectly, 
and the radioisotope detected by direct counting of iradioemission or by scintillation counting. 
Alternatively, test compounds can be enzymatically-labeled with, for example, horseradish 
peroxidase, alkaline phosphatase, or luciferase, and the enzymatic label detected by 

10 determination of conversion of an appropriate substrate to product. In one embodiment, the 

assay comprises contacting a cell which expresses a membrane-bound form of NOVX protein, 
' or a biologically- active portion thereof, on the cell surface with a known compound which 
binds NOVX to form an assay mixture, contacting the assay mixture with a test compound, 
and determining the ability of the test compound to interact with an NOVX protein, wherein 

1 5 determining the ability of the test compound to interact with t an NOVX protein comprises 
determining the ability of the test compound to preferentially bind to NOVX protein or a 
biologically-active portion thereof as compared to the known compound. 

In another embodiment, an assay is a cell-based assay comprising contacting a cell 
expressing a membrane-bound fonn of NOVX protein, or a biologically-active portion thereof, 

20 on the cell surface with a test compound and determining the ability of the test compound to 
modulate (e.g., stimulate or inhibit) the activity of the NOVX protein or biologically- active 
portion thereof. Determiiiing the ability of the test compound to modulate the activity of 
NOVX or a biologically-active portion thereof can be accomplished, for example, by 
determining the ability of the NOVX protein to bind to or interact with an NOVX target 

25 molecule. As used herein, a "target molecule" is a molecule with which an NOVX protein 
binds or interacts in nature, for example, a molecule on the surface of a cell which expresses 
an NOVX interacting protein, a molecule on the surface of a second cell, a molecule in the 
extracellular milieu, a molecule associated with the internal surface of a cell membrane or a 
cytoplasmic molecule. An NOVX target molecule can be a non-NOVX molecule or an 

30 NOVX protein or polypeptide of the invention. In one embodiment, an NOVX target 

molecule is a component of a signal transduction pathway that facilitates transduction of an 
extracellular signal (e.g. a signal generated by binding of a compound to a membrane-bound 
NOVX molecule) through the cell membrane and into the cell. The target, for example, can be 
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a second intercellular protein that has catalytic activity or a protein that facilitates the 
association of downstream signaling molecules with NOVX 

Detennining the ability of the NOVX protein to bind tb orinteract with an NOVX 
target molecule can be accomplished by one of the methods described above for determining 
5 direct binding. In one embodiment, determining the ability of the NOVX protein to bind to or 
interact with an NOVX target molecule can be accomplished by determining the activity of the 
target molecule. For example, the activity of the target molecule can be determined by 
detecting induction of a cellular second messenger of the target (i.e. intracellular Ca 2+ , 
diacylglycerol, IP3, etc.), detecting catalytic/enzymatic activity of the target an appropriate 
1 0 substrate, detecting the induction of a reporter gene (comprising an NOVX-responsive 

regulatory element operatively linked to a nucleic acid encoding a detectable marker, e.g., 
• luciferase), or detecting a cellular response, for example, cell survival, cellular differentiation, 
or cell proliferation. 

In yet another embodiment, an assay of the invention is a cell-free assay comprising 

1 5 contacting an NOVX protein or biologically-active portion thereof with a test compound and 
determining the ability of the test compound to bind to the NOVX protein or biologically- 
active portion thereof. Binding of the test compound to the NOVX protein can be determined 
either directly or indirectly as described above. In one such embodiment, the assay comprises 
contacting the NOVX protein or biologically-active portion thereof with a known compound 

20 which binds NOVX to form an assay mixture, contacting the assay mixture with a test 
compound, and determining the ability of the test compound to interact with an NOVX 
protein, wherein determining the ability of the test compound to interact with an NOVX 
protein comprises determining the ability of the test compound to preferentially bind to NOVX 
or biologically-active portion thereof as compared to the known compound. 

25 In still another embodiment, an assay is a cell-free assay comprising contacting NOVX 

protein or biologically-active portion thereof with a test compound and determining the ability 
of the test compound to modulate (e.g. stimulate or inhibit) the activity of the NOVX protein 
or biologically-active portion thereof. Determining the ability of the test compound to 
modulate the activity of NOVX can be accomplished, for example, by detennining the ability 

30 of the NOVX protein to bind to an NOVX target molecule by one of the methods described 

above for determining direct binding. In an alternative embodiment, determining the ability of 
the test compound to modulate the activity of NOVX protein can be accomplished by 
determining the ability of the NOVX protein further modulate an NOVX target molecule. For 
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example, the catalytic/enzymatic activity of the target molecule on an appropriate substrate 
can be determined as de£6ribed, supra. 

In yet another' embodiment, the cell-free assay comprises contacting the NOVX protein 
or biologically- active portion thereof with a known compound which binds NOVX protein to 
5 form an assay mixture, contacting the assay mixture with a test compound, and determining 
the ability of the test compound to interact with an NOVX protein, wherein determining the 
ability of the test compound to interact with an NOV^ protein comprises determining the , 
ability of the NOVX protein to preferentially bind to or modulate the activity of an NOVX 
target molecule. 

1 0 The cell-free assays of the invention are amenable to use of both the soluble form or 

the membrane-bound form of NOVX protein. In the case of cell-free assays comprising the 
membrane-bound fonn of NOVX protein, it may be desirable to utilize a solubilizing agent 
such that the membrane-bound form of NOVX protein is maintained in solution. Examples of 
such solubilizing agents include non-ionic detergents such as n-octylglucoside, 

1 5 n-dodecylglucoside, n-dodecylmaltoside, octanoyl~N-methylglucamide, 
decanoyl-N-methylglucamide, Triton® X-100, Triton® X-114, Thesit®, 
Isotridecypoly(ethylene glycol ether) n , N-dodecyl— N,N-dimethyl-3-ammonio-l -propane 
sulfonate, 3-(3-cholamidopropyl) dim ethyl amrniniol-1 -propane sulfonate (CHAPS), or 
3-(3-cholamidopropyl)dimethylamminiol~2-hydroxy-l -propane sulfonate (CHAPSO). 

20 In more than one embodiment of the above assay methods of the invention, it may be 

desirable to immobilize either NOVX protein or its target molecule to facilitate separation of 
complex ed from uncomplexed forms of one or both of the proteins, as well as to accommodate 
automation of the assay. Binding of a test compound to NOVX protein, or interaction of 
NOVX protein with a taz-get molecule in the presence and absence of a candidate compound, 

25 can be accomplished in any vessel suitable for containing the reactants. Examples of such 

vessels include microtiter plates, test tubes, and micro-centrifuge tubes. In one embodiment, a 
fusion protein can be provided that adds a domain that allows one or both of the proteins to be 
bound to a matrix. For example, GST-NOVX fusion proteins or GST-target fusion proteins 
can be adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or 

30 glutathione derivatized microtiter plates, that are then combined with the test compound or the 
test compound and either the non-adsorbed target protein or NOVX protein, and the mixture is 
incubated under conditions conducive to complex formation (e.g., at physiological conditions 
for salt and pH). Following incubation, the beads or microtiter plate wells are washed to 
remove any unbound components, the matrix immobilized in the case of beads, complex 
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determined either directly or indirectly, for example, as described, supra. Alternatively, the 
complexes can be dissociated from the matrix,' and the level of NOVX protein binding or 
activity determined using' standard techniques. : ' p 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either the NOVX protein or its target 
molecule can be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated 
NOVX protein or target molecules can be prepared from biotin-NHS ■ ' ' , 

(N-hydroxy-succinimide) using techniques well-known within the art (e.g., biotinylation kit, 
Pierce Chemicals, Rockford, 111.), arid immobilized in the wells of streptavi din-coated 96 we]] 
plates (Pierce Chemical). Alternatively, antibodies reactive with NOVX protein or target 
molecules, but which do not interfere with binding of the NOVX protein to its target molecule, 
can be derivatized to the wells of the plate, and unbound target or NOVX protein trapped in 
the wells by antibody conjugation. Methods for detecting such complexes, in addition to those 
described above for the GST-immobilized complexes, include immunodetection of complexes 
using antibodies reactive with the NOVX protein or target molecule, as well as enzyme-linked 
assays that rely on detecting an enzymatic activity associated with the NOVX protein or target 
molecule. 

In another embodiment, modulators of NOVX protein expression are 1 identified in a 
method wherein a cell is contacted with a candidate compound and the expression of NOVX 
mRNA or protein in the cell is detennined. The level of expression of NOVX mRNA or 
protein in the presence of the candidate compound is compared to the level of expression of 
NOVX mRNA or protein in the absence of the candidate compound. The candidate 
compound can then be identified as a modulator of NOVX mRNA or protein expression based 
upon this comparison. For example, when expression of NOVX mRNA or protein is greater 
(/.£., statistically significantly greater) in the presence of the candidate compound than in its 
absence, the candidate compound is identified as a stimulator of NOVX mRNA or protein 
expression. Alternatively, when expression of NOVX mRNA or protein is less (statistically 
significantly less) in the presence of the candidate compound than in its absence, the candidate 
compound is identified as an inhibitor of NOVX mRNA or protein expression. The level of 
NOVX mRNA or protein expression in the cells can be detennined by methods described 
herein for detecting NOVX mRNA or protein. 

In yet another aspect of the invention, the NOVX proteins can be used as "bait 
proteins" in a two-hybrid assay or three hybrid assay {see, e.g., U.S. Patent No. 5,283,317; 
Zervos, et aL, 1993. Cellll: 223-232; Madura, et at., 1993. J. Biol Chem. 268: 12046-12054; 
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Bartel, et al, 1993. Biotechniques 14: 920-924; Iwabuchi, et aL, 1993. Oncogene 8: 
1693-1696; and Brent WO 94/10300), to identify other proteins that bind to or interact with ' 
NOVX ("NOVX-binding proteins" or "NOVX~bp") and modulate NOVX activity. Such- 
NOVX-binding proteins are also likely to be involved in the propagation of signals by the 
5 NOVX proteins as, for example, upstream or downstream elements of the NOVX pathway. 

The two-hybrid system is based on the modular nature of most transcription factors, 
which consist of separable DNA-binding and activation domaifis. Briefly, the assay utilizes 

two different DNA constructs. In one construct, the gene that codes for NOVX is fused to a 

t 

gene encoding the DNA binding domain of a known transcription factor (e.g., GAL-4). In the 
1 0 other construct, a DNA sequence, from a library of DNA sequences, that encodes an 

unidentified protein ("prey" or "sample") is fused to a gene that codes for the activation 
1 domain of the known transcription factor. If the "bait" and the "prey" proteins are able to 
interact, in vivo, forming an NOVX-dependent complex, the DNA-binding and activation 
domains of the transcription factor are brought into close proximity. This proximity allows 
1 5 transcription of a reporter gene (e.g., LacZ) that is operably linked to a transcriptional 

regulatory site responsive to the transcription factor. Expression of the reporter gene can be 
detected and cell colonies containing the functional transcription factor can be isolated and 
used to obtain the cloned gene that encodes the protein which interacts with NOVX. 

The invention further pertains to novel agents identified by the aforementioned 
20 screening assays and uses thereof for treatments as described herein. 

Detection Assays 

Portions or fragments of the cDNA sequences identified herein (and the corresponding 
complete gene sequences) can be used in numerous ways as polynucleotide reagents. By way 
of example, and not of limitation, these sequences can be used to: (i) map their respective 
25 genes on a chromosome; and, thus, locate gene regions associated with genetic disease; (it) 

identify an individual from a minute biological sample (tissue typing);- and (Hi) aid in forensic 
identification of a biological sample. Some of these applications are described in the 
subsections, below. 

Chromosome Mapping 

30 Once the sequence (or a portion of the sequence) of a gene has been isolated, this 

sequence can be used to map the location of the gene on a chromosome. This process is called 
chromosome mapping. Accordingly, portions or fragments of the NOVX sequences, SEQ ID 
NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 34, 35, 37, 39, 41, 43, 45, 47 
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and 49, or fragments or derivatives thereof, can be used to map the location of the NOVX 
genes, respectively, on a chromosome. The mapping of the NOVX sequences to 
chromosomes is an important first step in correlating' these sequences with genes associated 
with disease. 

5 Briefly, NOVX genes can be mapped to chromosomes by preparing PCR primers 

(preferably 1 5-25 bp in length) from the NOVX sequences. Computer analysis of the NOVX, 
sequences can be used to rapidly select primers that, do not span more than one ex on in the 
genomic DNA, thus complicating the amplification process. These primers can then be used 
for PCR screening of somatic cell hybrids containing individual human chromosomes. Only 
1 0 {hose hybrids containing the human gene corresponding to the NOVX sequences will yield an 
amplified fragment. 

• ■ Somatic cell hybrids are prepared by fusing somatic cells from different mammals 
(e.g., human and mouse cells). As hybrids of human and mouse cells grow and divide, they 
gradually lose human chromosomes in random order, but retain the mouse chromosomes. By 

1 5 using media in which mouse cells cannot grow, because they Jack a particular enzyme, but in 
which human cells can, the one human chromosome that contains the gene encoding the 
needed enzyme will be retained. By using various media, panels of hybrid cell lines can be 
established. Each cell line in a panel contains either a single human chromosome or a small 
number of human chromosomes, and a fall set of mouse chromosomes, allowing easy 

20 mapping of individual genes to specific human chromosomes. See, e.g., D'Eustachio, et al., 
1983. Science 220: 919-924. Somatic cell hybrids containing only fragments ofhuman 
chromosomes can also be produced by using human chromosomes with translocations and 
deletions. 

PCR mapping of somatic cell hybrids is a rapid procedure for assigning a particular 
25 sequence to a particular chromosome. Three or more sequences can be assigned per day using 
a single thermal cycler. Using the NOVX sequences to design oligonucleotide primers, sub- 
localization can be achieved with panels of fragments from specific chromosomes. 

Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase 
chromosomal spread can further be used to provide a precise chromosomal location in one 
30 step. Chromosome spreads can be made using cells whose division has been blocked in 

metaphase by a chemical like colcemid that disrupts the mitotic spindle. The chromosomes 
can be treated briefly with trypsin, and then stained with Giemsa. A pattern of light and dark 
bands develops on each chromosome, so that the chromosomes can be identified individually. 
The FISH technique can be used with a DNA sequence as short as 500 or 600 bases. 
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However, clones larger than 1,000 bases have a higher likelihood of binding to a unique ■ 
chromosomal location with sufficient signal intensity for simple detection. Preferably 1,000 
bases, and more preferably 2,000 bases, will suffice to getgood results at a reasonable amount 
of time. For a review of this technique, see, Verma, et aL, Human CHROMOSOMES : A 
5 Manual of Basic Techniques (Pergamon Press, New York 1 988). 

Reagents for chromosome mapping can be used individually to mark a single 
chromosome or a single site on that chromosome, or panels of reagents can be used for ' ' 
marking multiple sites and/or multiple chromosomes. Reagents corresponding to noncoding 
regions of the genes actually are preferred for mapping purposes. Coding sequences are more 

.0 likely to be conserved within gene families, thus increasing the chance of cross hybridizations 
during chromosomal mapping. 

Once a sequence has been mapped to a precise chromosomal 16cation, the physical 
position of the sequence on the chromosome can be correlated with genetic map data. Such 
data are found, e.g., in McKusick, Mendelian INHERITANCE IN MAk, available on-line 

.5 through Johns Hopkins University Welch Medical Library). The relationship between genes 
and disease, mapped to the same chromosomal region, can then be identified through linkage 
analysis (co-inheritance of physically adjacent genes), described in, e.g., Egeland, et ah, 1987. 
Nature, 325: 783-787. 

Moreover, differences in the DNA sequences between individuals affected and 

:0 unaffected with a disease associated with the NOVX gene, can be determined. If a mutation is 
observed in some or all of the affected individuals but not in any unaffected individuals, then 
the mutation is likely to be the causative agent of the particular disease. Comparison of 
affected and unaffected individuals generally involves first looking for structural alterations in 
the chromosomes, such as deletions or translocations that are visible from chromosome 

:5 spreads or detectable using PCR based on that DNA sequence. Ultimately, complete 

sequencing of genes from several individuals can be performed to confirm the presence of a 
mutation and to distinguish mutations from polymorphisms. 

Tissue Typing 

The NOVX sequences of the invention can also be used to identify individuals from 
•0 minute biological samples. In this technique, an individual's genomic DNA is digested with 
one or more restriction enzymes, and probed on a Southern blot to yield unique bands for 
identification. The sequences of the invention are useful as additional DNA markers for RFLP 
("restriction fragment length polymorphisms," described in U.S. Patent No. 5,272,057). 
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Furthermore, the sequences of the invention can be used to provide an alternative 
technique that determines the actual base-by-base DNA sequence of selected portions of an . 
individual's genome. Thus, the NOVX sequences described herein can be used to prepare two 
PCR primers from the 5'- and 3 termini of the sequences.' These primers can then be used to 
5 amplify an individual's DNA and subsequently sequence it. 

Panels of corresponding DNA sequences from individuals, prepared in this manner, 
can provide unique individual identifications, as each individual will have a unique set of such 

i 

DNA sequences due to allelic differences. The sequences of the invention can be used to 
obtain such identification sequences from individuals and from tissue. The NOVX sequences 

1 0 of the invention uniquely represent portions of the human genome. Allelic variation occurs to 
some degree in the coding regions of these sequences, and to a greater degree in the noncoding 
' regions. It is estimated that allelic variation between individual humans occurs with a 
frequency of about once per each 500 bases. Much of the allelic variation is due to single 
nucleotide polymorphisms (SNPs), which include restriction fragment length polymorphisms 

15 (RFLPs). 

Each of the sequences described herein can, to some degree, be used as a standard 
against which DNA from an individual can be compared for identification purposes. Because 
greater numbers of polymorphisms occur in the noncoding regions, fewer sequences are 
necessary to differentiate individuals. The noncoding sequences can comfortably provide 
20 positive individual identification with a panel of perhaps 10 to 1,000 primers that each yield a 
noncoding amplified sequence of 100 bases. If predicted coding sequences, such as those in 
SEQIDNOS:l,3, 5,7, 9, 11, 13, 15, 17, 19,21,23,25, 27,29,31,33, 34,35,37,39,41,43, 
45, 47 and 49 are used, a more appropriate number of primers for positive individual 
identification would be 500-2,000. 

25 Predictive Medicine 

The invention also pertains to the field of predictive medicine in which diagnostic 
assays, prognostic assays, pharmacogenomics, and monitoring clinical trials are used for 
prognostic (predictive) purposes to thereby treat an individual prophylactically. Accordingly, 
one aspect of the invention relates to diagnostic assays for determining NOVX protein and/or 
30 nucleic acid expression as well as NOVX activity, in the context of a biological sample {e.g., 
blood, serum, cells, tissue) to thereby determine whether an individual is afflicted with a 
disease or disorder, or is at risk of developing a disorder, associated with aberrant NOVX 
expression or activity. The disorders include metabolic disorders, diabetes, obesity, infectious 

209 

BNSDOCID: <:WO_ 02055704 A2_l_> 



WO 02/055704 PCT/US02/00554 

disease, anorexia, cancer-associated cachexia, cancer, neurodegenerative disorders, 
Alzheimer's Disease, PdrTdnson's' Disorder, immune disorders, and hematopoietic disorders, 
and the various dyslipidehiias, ntetabolic disturbances associated with obesity, the metabolic 
syndrome X and wasting disorders associated with chronic, diseases and various cancers. The 
5 invention also provides for prognostic (or predictive) assays for determining whether an 
individual is at risk of developing a disorder associated with NOVX protein, nucleic acid 
expression or activity. For exarrfple, mutations in an NOVX gene can be assayed in a: 1 
biological sample. Snch assays can be used for prognostic or predictive purpose to thereby 
prophylactically treat an individual prior to the onset of a disorder characterized by or 

10 associated with NOVX protein, nucleic acid expression, or biological activity. 

Another aspect of the invention provides methods for determining NOVX protein, 
nucleic acid expression or activity in an individual to thereby select appropriate therapeutic or 
prophylactic agents for that individual (referred to herein as "pharmacogenomics"). 
Pharmacogenomics allows for the selection of agents (e.g. , drugd) for therapeutic or 

1 5 prophylactic treatment of an individual based on the genotype of the individual (e.g., the 

genotype of the individual examined to determine the ability of the individual to respond to a 
particular agent.) 

Yet another aspect of the invention pertains to monitoring the influence of agents (e.g., 
drugs, compounds) on the expression or activity of NOVX in clinical trials. 
20 These and other agents are described in further detail in the following sections. 

Diagnostic Assays 

An exemplary method for detecting the presence or absence of NOVX in a biological 
sample involves obtaining a biological sample from a test subject and contacting the biological 
sample with a compound or an agent capable of detecting NOVX protein or nucleic acid (e.g., 

25 mRNA, genomic DNA) that encodes NOVX protein such that the presence of NOVX is 

detected in the biological sample. An agent for detecting NOVX mRNA or genomic DNA is a 
labeled nucleic acid probe capable of hybridizing to NOVX mRNA or genomic DNA. The 
nucleic acid probe can be, for example, a full-length NOVX nucleic acid, such as the nucleic 
acid of SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 34, 35, 37, 39, 

30 41, 43, 45, 47 and 49, or a portion thereof, such as an oligonucleotide of at least 15, 30, 50, 
1 00, 250 or 500 nucleotides in length and sufficient to specifically hybridize under stringent 
conditions to NOVX mRNA or genomic DNA. Other suitable probes for use in the diagnostic 
assays of the invention are described herein. 
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An agent for detecting NOVX protein is an antibody capable of binding to NOVX 
protein, preferably an antibody with a detectable label. Antibodies can be polyclonal, or more 
preferably, monoclonal. An intact antibody, or a fragment thereof (e.g., Fab or F(ab')2) can be 
used. The term "labeled", with regard to the probe or antibody, is intended to encompass 
5 direct labeling of the probe or antibody by coupling (i.e., physically linking) a detectable 
substance to the probe or antibody, as well as indirect labeling of the probe or antibody by 
reactivity with another reagent that is directly labeled, fexamples of indirect labeling include 

detection of a primary antibody using a fluorescently-labeled secondary antibody and 

i 

end-labeling of a DNA probe with biotin such that it can be detected with fluorescently- 
10 labeled streptavidin. The term "biological sample" is intended to include tissues, cells and 

biological fluids isolated from a subject, as well as tissues, cells and fluids present within a 

t 

subject. That is, the detection method of the invention can be used to detect NOVX mRNA, 
protein, or genomic DNA in a biological sample in vitro as well as in vivo. For example, in 
vitro techniques fpr detection of NOVX mRNA include Northern hybridizations and in situ 

15 hybridizations. In vitro techniques for detection of NOVX protein include enzyme linked 
immunosorbent assays (ELISAs), Western blots, immunoprecipitations, and 
immunofluorescence. In vitro techniques for detection of NOVX genomic DNA include 
Southern hybridizations. Furthermore, in vivo techniques for detection of NOVX protein 
include introducing into a subject a labeled anti-NOVX antibody. For example, the antibody 

20 can be labeled with a radioactive marker whose presence and location in a subject can be 
detected by standard imaging techniques. 

In one embodiment, the biological sample contains protein molecules from the test 
subject. Alternatively, the biological sample can contain mRNA molecules from the test 
subject or genomic DNA molecules from the test subject. A preferred biological sample is a 

25 peripheral blood leukocyte sample isolated by conventional means from a subject. 

hi another embodiment, the methods further involve obtaining a control biological 
sample from a control subject, contacting the control sample with a compound or agent 
capable of detecting NOVX protein, mRNA, or genomic DNA, such that the presence of 
NOVX protein, mRNA or genomic DNA is detected in the biological sample, and comparing 

30 the presence of NOVX protein, mRNA or genomic DNA in the control sample with the 
presence of NOVX protein, mRNA or genomic DNA in the test sample. 

The invention also encompasses kits for detecting the presence of NOVX in a 
biological sample. For example, the kit can comprise: a labeled compound or agent capable of 
detecting NOVX protein or mRNA in a biological sample; means for determining the amount 
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of NOVX in the sample; and means for comparing the amount of NOVX in the sample with a 
standard. The compound or agent can be packaged in a suitable container. The kit can further , 
comprise instructions for using the kit to detect NOVX protein or nucleic acid. 

Prognostic Assays 

5 The diagnostic methods described herein can furthermore be utilized to identify 

subjects having or at risk of developing a disease or disorder associated with aberrant NOVX 

1 

expression or activity. For example, the assays described herein, such as the preceding 
diagnostic assays or the following assays, can be utilized to identify a subject having or at risk 
of developing a disorder associated with NOVX protein, nucleic acid expression or activity. , 

.0 Alternatively, the prognostic assays can be utilized to identify a subject having or at risk for 
developing a disease or disorder. Thus, the invention provides a method for identifying a 
disease or disorder associated with aberrant NOVX expression or activity in which a test 
sample is obtained from a subject and NOVX protein or nucleic acid (e.g., mRNA, genomic 
DNA) is detected, wherein the presence of NOVX protein or nucleic acid is diagnostic for a 

5 subject having or at risk of developing a disease or disorder associated with aberrant NOVX 
expression or activity. As used herein, a ,f test sample" refers to a biological sample obtained 
from a subject of interest. For example, a test sample can be a biological fluid (e.g., serum), 
cell sample, or tissue. 

Furthermore, the prognostic assays described herein can be used to determine whether 
0 a subject can be administered an agent (e.g., an agonist, antagonist, peptidomimetic, protein, 
peptide, nucleic acid, small molecule, or other drug candidate) to treat a disease or disorder 
associated with aberrant NOVX expression or activity. For example, such methods can be 
used to determine whether a subject can be effectively treated with an agent for a disorder. 
Thus, the invention provides methods for determining whether a subject can be effectively 
5 treated with an agent for a disorder associated with aberrant NOVX expression or activity in 
which a test sample is obtained and NOVX protein or nucleic acid is detected (e.g., wherein 
the presence of NOVX protein or nucleic acid is diagnostic for a subject that can be 
administered the agent to treat a disorder associated with aberrant NOVX expression or 
activity). 

0 The methods of the invention can also be used to detect genetic lesions in an NOVX 

gene, thereby determining if a subject with the lesioned gene is at risk for a disorder 
characterized by aberrant cell proliferation and/or differentiation. In various embodiments, the 
methods include detecting, in a sample of cells from the subject, the presence or absence of a 
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genetic lesion characterized by at least one of an alteration affecting the integrity of a gene 
"" encoding an NOVX-protein, or the misexpression of the NOVX gene. For example, such 
genetic lesions can be detected by ascertaining the existence of at least one of: (i) a deletion of 
one or more nucleotides from an NOVX gene; (zz) an addition of one or more nucleotides to an 
; 5 NOVX gene; (Hi) a substitution of one or more nucleotides of an NOVX gene, (zv) a 

chromosomal rearrangement of an NOVX gene; (v) an alteration in the level of a messenger 
RNA transcript of an NOVX gene, (vi) aberrant modification of an NOVX gene, such as of the 
methylation pattern of the genomic DNA, (vii) the presence of a non-wild-type splicing pattern 
of a messenger RNA transcript of an NOVX gene, (viii) a non-wild-type level of an NOVX 
3 0 protein, (ix) allelic loss of an NOVX gene, and (x) inappropriate post-translational 

modification of an NOVX protein. As described herein, there are a large number of assay 
techniques known in the art which can be used for detecting lesions in an NOVX gene. A 
preferred biological sample is a peripheral blood leukocyte sample isolated by conventional 
means from a subject. However, any biological sample containing nucleated cells maybe 
15 used, including, for example, buccal mucosal cells. , 

In certain embodiments, detection of the lesion involves the use of a probe/primer in a 
polymerase chain reaction (PCR) (see, e.g., U.S. Patent Nos. 4,683,195 and 4,683,202), such 
as anchor PCR or RACE PCR, or, alternatively, in a ligation chain reaction (LCR) (see, e.g., 
Landegran, et al 9 1988. Science 241: 1077-1080; and Nakazawa, et al 9 1994. Proc. Natl. 

i 

20 Acad. Sci. USA 91 : 360-364), the latter of which can be particularly useful for detecting point 
mutations in the NOVX-gene (see, Abravaya, et ah, 1995. Nucl Acids Res. 23: 675-682). 
This method can include the steps of collecting a sample of cells from a patient, isolating 
nucleic acid (e.g., genomic, mRNA or both) from the cells of the sample, contacting the 
nucleic acid sample with one or more primers that specifically hybridize to an NOVX gene 

25 under conditions such that hybridization and amplification of the NOVX gene (if present) 

occurs, and detecting the presence or absence of an amplification product, or detecting the size 
of the amplification product and comparing the length to a control sample. It is anticipated 
that PCR and/or LCR may be desirable to use as a preliminary amplification step in 
conjunction with any of the techniques used for detecting mutations described herein. 

30 Alternative amplification methods include: self sustained sequence replication (see, 

Guatelli, et aL 9 1990. Proc. Natl. Acad. Sci. USA 87: 1874-1878), transcriptional amplification 
system (see, Kwoh, et aL, 1989. Proc. Natl Acad. Sci. USA 86: 1 173-1 177); QP Replicase 
(see, Lizardi, et at, 1988. BioTechnology 6: 1 197), or any other nucleic acid amplification 
method, followed by the detection of the amplified molecules using techniques well known to 
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those of skill in the art. These detection schemes are especially useful for the detection of 
nucleic acid molecules if such molecules are present in very low numbers. 

In an alternative embodiment, mutations in an NOVX gene from a sample cell can be 
identified by alterations in restriction enzyme cleavage patterns. For example, sample and 
5 control DNA is isolated, amplified (optionally), digested with one or more restriction 

endonucleases, and fragment length sizes are determined by gel electrophoresis and compared. 
Differences in fragment length siizes between sample and control DNA indicates mutations in 
the sample DNA. Moreover, the use of sequence specific ribozymes (see, e.g., U.S. Patent 
No. 5,493,531) can be used to score for the presence of specific mutations by development or 

10 loss of a ribpzyme cleavage site. 

In other embodiments, genetic mutations in. NOVX can be identified by hybridizing a 
sample and control nucleic acids, e.g., DNA or RNA, to high-density arrays containing 
hundreds or thousands of oligonucleotides probes. See, e.g., Cronin, et al., 1996. Human 
Mutation 7: 244-255; Kozal, et al, 1996. Nat. Med. 2: 753-759. For example,' genetic 

1 5 mutations in NOVX can be identified in two dimensional arrays containing light-generated 

DNA probes as described in Cronin, et aL, supra. Briefly, a first hybridization array of probes 
can be used to scan through long stretches of DNA in a sample and control to identify base 
changes between the sequences by making linear arrays of sequential overlapping probes. 
This step allows the identification of point mutations. This is followed by a second 

20 hybridization array that allows the characterization of specific mutations by using smaller, 

specialized probe arrays complementary to all variants or imitations detected. Each mutation 
array is composed of parallel probe sets, one complementary to the wild-type gene and the 
other complementary to the mutant gene. 

In yet another embodiment, any of a variety of sequencing reactions known in the art 

25 can be used to directly sequence the NOVX gene and detect mutations by comparing the 
sequence of the sample NOVX with the corresponding wild-type (control) sequence. 
Examples of sequencing reactions include those based on techniques developed by Maxim and 
Gilbert, 1977. Proa Natl. Acad. Sci. USA 74: 560 or Sanger, 1977. Proc. Natl Acad. Sci. USA 
74: 5463. It is also contemplated that any of a variety of automated sequencing procedures 

30 can be utilized when performing the diagnostic assays (see, e.g., Naeve, et aL, 1995. 
Biotechniques 19: 448), including sequencing by mass spectrometry (see, e.g., PCT 
International Publication No. WO 94/16101; Cohen, et aL, 1996. Adv. Chromatography 36: 
127-162; and Griffin, et aL, 1993. Appl. Biochem. Biotechnol. 38: 147-159). 
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Other methods for detecting mutations in the NOVX gene include methods in which 
protection from cleavage agents is used to detect mismatched bases in RNAvTRNA or 
RNA/DNA heteroduplexes. See\ e.g., Myers, et ah, 1985. Science 230: 1242. In jgeneral, the 
art technique of "mismatch cleavage" starts by providing heteroduplexes of formed by 
5 hybridizing (labeled) RNA or DNA containing the wild-type NOVX sequence with potentially 
mutant RNA or DNA obtained from a tissue sample. The double-stranded duplexes are 
treated with an agent that cleave^ single-stranded regions of the duplex such as which Will 
exist due to basepair mismatches between the control and sample strands. For instance, 
RNA/DNA duplexes can be treated with RNase and DNA/DNA hybrids treated with Sj 

1 0 nuclease to enzymatically digesting the mismatched regions. In other embodiments, either 
DNA/DNA or RNA/DNA duplexes can be treated with hydroxylamine or osmium tetroxide 
and with piperidine in order to digest mismatched regions. After digestion of the mismatched 
regions, the resulting material is tfye;n separated by size on denaturing polyacryl amide gels to 
determine the site of mutation. See, e.g., Cotton, et ah, 1988. Proc. Natl. Acad. Sci. USA 85: 

15 4397; Saleeba, et al, 1992. Methods Enzymol. 217: 286-295. In an embodiment, the control 
DNA or RNA can be labeled for detection. 

In still another embodiment, the mismatch cleavage reaction employs one or more 
proteins that recognize mismatched base pairs in double-stranded DNA (so called "DNA 
mismatch repair" enzymes) in defined systems for detecting and mapping point mutations in 

20 NOVX cDNAs obtained from samples of cells. For example, the mutY enzyme of E. coli 

cleaves A at G/A mismatches and the thymidine DNA glycosylate from HeLa cells cleaves T 
at G/T mismatches. See, e.g., Hsu, et al, 1994. Carcinogenesis 15: 1657-1662. According to 
an exemplary embodiment, a probe based on an NOVX sequence, e.g., a wild-type NOVX 
sequence, is hybridized to a cDNA or other DNA product from a test cell(s). The duplex is 

25 treated with a DNA mismatch repair enzyme, and the cleavage products, if any, can be 
detected from electrophoresis protocols or the like. See, e.g., U.S. Patent No. 5,459,039. 

In other embodiments, alterations in electrophoretic mobility will be used to identify 
mutations in NOVX genes. For example, single strand conformation polymorphism (SSCP) 
may be used to detect differences in electrophoretic mobility between mutant and wild type 

30 nucleic acids. See, e.g., Orita, et aL, 1989. Proc, Natl. Acad. Sci. USA: 86: 2766; Cotton, 
1993. Mutat. Res. 285: 125-144; Hayashi, 1992. Genet. Anal Tech. Appl. 9: 73-79. 
Single-stranded DNA fragments of sample and control NOVX nucleic acids will be denatured 
and allowed to renature. The secondary structure of single-stranded nucleic acids varies 
according to sequence, the resulting alteration in electrophoretic mobility enables the detection 
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of even a single base change. The DNA fragments may be labeled or detected with labeled 
probes. The sensitivity of the assay may be enhanced by using RNA (rather than DNA), in • 
which the secondary structure is more sensitive to a change in sequence. In one embodiment, 
the subject method utilizes heteroduplex analysis to separate double stranded heteroduplex 
5 molecules on the basis of changes in electrophoretic mobility. See, e.g., Keen, et aL, 1991. 
Trends Genet. 7: 5. • 

In yet another embodiment, the movement of mutant oi" wild-type fragments in 
polyacrylamide gels containing a gradient of denaturant is assayed using denaturing gradient 
gel electrophoresis (DGGE). See, e.g., Myers, et aL, 1985. Nature 313: 495. When DGGE is 

10 used as the method of analysis, DNA will be modified to insure that it does not completely 

denature, for example by adding a GC clamp of approximately 40 bp of high-melting GC-rich 
' DNA by PCR. In a further embodiment, a temperature gradient is used in place of a 
denaturing gradient to identify differences in the mobility of control and sample DNA. See, 
e.g., Rosenbaum and Reissner, 1987. Biophys. Chem. 265: 12753. 

15 Examples of other techniques for detecting point mutations include, but are not limited 

to, selective oligonucleotide hybridization, selective amplification, or selective primer 
extension. For example, oligonucleotide primers may be prepared in which the known 
mutation is placed centrally and then hybridized to target DNA under conditions that permit 
hybridization only if a perfect match is found. See, e.g., Saiki, et aL, 1986. Nature 324: 163; 

20 Saiki, et aL, 1989. Proc. Natl. Acad. Sci. USA 86: 6230. Such allele specific oligonucleotides 
are hybridized to PCR amplified target DNA or a number of different mutations when the 
oligonucleotides are attached to the hybridizing membrane and hybridized with labeled target 
DNA. 

Alternatively, allele specific amplification technology that depends on selective PCR 
25 amplification may be used in conjunction with the instant invention. Oligonucleotides used as 
primers for specific amplification may cany the mutation of interest in the center of the 
molecule (so that amplification depends on differential hybridization; see, e.g., Gibbs, et aL, 
1989. Nucl. Acids Res. 17: 2437-2448) or at the extreme 3'-terminus of one primer where, 
under appropriate conditions, mismatch can prevent, or reduce polymerase extension (see, e.g., 
30 Prossner, 1993. Tibtech. 1 1 : 238). In addition it may be desirable to introduce a novel 

restriction site in the region of the mutation to create cleavage-based detection. See, e.g., 
Gasparini, et aL, 1992. Mol. Cell Probes 6: 1. It is anticipated that in certain embodiments 
amplification may also be performed using Taq ligase for amplification. See, e.g., Barany, 
1991. Proc. Natl. Acad. Sci. USA 88: 189. In such cases, ligation will occur only if there is a 
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perfect match at the 3 '-terminus of the 5* sequence, making it possible fo detect the presence of 
a known mutation at a specific siteby looking for the presence or absence of amplification. 

The methods described herein may be performed, for example, by utilizing 
pre-packaged diagnostic kits comprising at least one probe nucleic acid or antibody reagent 
5 described herein, which may be conveniently used, e.g., in clinical settings to diagnose 
patients exhibiting symptoms or family history of a disease or illness involving an NOVX 
gene. , • ■ . 

Furthermore, any cell type or tissue, preferably peripheral blood leukocytes, in which 
NOVX is expressed may be utilize^ in the prognostic assays described herein. However, any 
1 0 biological sample containing nucleated cells may be used, including, for example, buccal 
mucosal cells. 

Pharmacogenomics 

Agents, or modulators that 'have a stimulatory or inhibitory effect on NOVX activity 
(e.g., NOVX gene expression), as identified by a screening assay described herein can be 

1 5 administered to individuals to treat (prophylactically or therapeutically) disorders (The 

disorders include metabolic disorders, diabetes, obesity, infectious disease, anorexia, cancer- 
associated cachexia, cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's 
Disorder, immune disorders, and hematopoietic disorders, and the various dyslipidemias, 
metabolic disturbances associated with obesity, the metabolic syndrome X and wasting 

20 disorders associated with chronic diseases and various cancers.) In conjunction with such 
treatment, the pharmacogenomics (i.e., the study of the relationship between an individual's 
genotype and that individual's response to a foreign compound or drug) of the individual may 
be considered. Differences in metabolism of therapeutics can lead to severe toxicity or 
therapeutic failure by altering the relation between dose and blood concentration of the 

25 pharmacologically active drug. Thus, the pharmacogenomics of the individual permits the 

selection of effective agents (e.g., drugs) for prophylactic or therapeutic treatments based on a 
consideration of the individual's genotype. Such pharmacogenomics can further be used to 
determine appropriate dosages and therapeutic regimens. Accordingly, the activity of NOVX 
protein, expression of NOVX nucleic acid, or mutation content of NOVX genes in an 

30 individual can be determined to thereby select appropriate agent(s) for therapeutic or 
prophylactic treatment of the individual. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected persons. See 
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e.g., Eichelbaum, 1996. Clin. Exp. Pharmacol Physiol, 23: 983-985; Under, 1997. Clin. 
"" Chem., 43: 254-266. In general, two types of pharmacogenetic conditions can be 

differentiated. Genetic conditions transmitted as a sirigle factor altering the way drugs act on 
the body (altered drug action) or genetic conditions transmitted as single factors altering the 
5 way the body acts on drugs (altered drug metabolism). These pharmaco genetic conditions can 
occur either as rare defects or as polymorphisms. For example, glucose-6-phosphate 
dehydrogenase (G6PD) deficiency is a common inherited enzymopathy in which the main 
clinical complication is hemolysis after ingestion of oxidant drugs (anti-malarials,. 
sulfonamides, analgesics, nitroflirans) and consumption of fava beans. 
10 ' , As an illustrative embodiment, the activity of drug metabolizing enzymes is a major ' 
determinant of both the intensity and duration of drug action. The discovery of genetic 
1 polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 2) and 
cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation as to why 
some patients do not obtain the expected drug effects or show exaggerated drug response and 
1 5 serious toxicity after taking the standard and safe dose of a drug. These polymorphisms are 
expressed in two phenotypes in the population, the extensive metabolizer (EM) and poor 
metabolizer (PM). The prevalence of PM is different among different populations. For 
example, the gene coding for CYP2D6 is highly polymorphic and several mutations have been 
identified in PM, which all lead to the absence of functional CYP2D6. Poor metabolizers of 
20 CYP2D6 and CYP2C19 quite frequently experience exaggerated drug response and side 

effects when they receive standard doses. If a metabolite is the active therapeutic moiety, PM 
show no therapeutic response, as demonstrated for the analgesic effect of codeine mediated by 
its CYP2D6-formed metabolite morphine. At the other extreme are the so called ultra-rapid 
metabolizers who do not respond to standard doses. Recently, the molecular basis of 
25 ultra-rapid metabolism has been identified to be due to CYP2D6 gene amplification. 

Thus, the activity of NOVX protein, expression of NOVX nucleic acid, or mutation 
content of NOVX genes in an individual can be determined to thereby select appropriate 
agent(s) for therapeutic or prophylactic treatment of the individual. In addition, 
pharmacogenetic studies can be used to apply genotyping of polymorphic alleles encoding 
30 drug-metabolizing enzymes to the identification of an individual's drug responsiveness 

phenotype. This knowledge, when applied to dosing or drug selection, can avoid adverse 
reactions or therapeutic failure and thus enhance therapeutic or prophylactic efficiency when 
treating a subject with an NOVX modulator, such as a modulator identified by one of the 
exemplary screening assays described herein. 
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Monitoring of Effects During Clinical Trials 

Monitoring the influence of agents (e.g., drugs, compounds) on the expression or 

activity ,of NOVX (e.g., the ability to modulate aberrant cell proliferation and/or 

differentiation) can be applied not only in basic drug screening, but also in clinical trials. For 

5 example, the effectiveness of an agent determined by a screening assay as described herein to 

increase NOVX gene expression, protein levels, or upregulate NOVX activity, can be 

monitored in clinical trails of subjects exhibiting decreased N6VX gene expression, protein 

levels, or downregulated NOVX activity. Alternatively, the effectiveness of an agent 

i 

determined by a screening assay to decrease NOVX gene expression, protein levels, or 
10 ddwnregulate NOVX activity, canbe monitored in clinical trails of subjects exhibiting 

increased NOVX gene expression, protein levels, or upregulated NOVX activity. In such 

i 

clinical trials, the expression or activity of NOVX and, preferably, other genes that have been 
implicated in, for example, a cellular proliferation or immune disorder can be used as a "read 
out" or markers of the immune responsiveness of a particular cell. 

1 5 By way of example, and not of limitation, genes, including NOVX, that are modulated 

in cells by treatment with an agent (e.g., compound, drug or small molecule) that modulates 
NOVX activity (e.g., identified in a screening assay as described herein) can be identified. 
Thus, to study the effect of agents on cellular proliferation disorders, for example, in a clinical 
trial, cells can be isolated and RNA prepared and analyzed for the levels of expression of 

20 NOVX and other genes implicated in the disorder. The levels of gene expression (i.e., a gene 
expression pattern) can be quantified by Northern blot analysis or RT-PCR, as described 
herein, or alternatively by measuring the amount of protein produced, by one of the methods 
as described herein, or by measuring the levels of activity of NOVX or other genes. In this 
manner, the gene expression pattern can serve as a marker, indicative of the physiological 

25 response of the cells to the agent. Accordingly, this response state may be determined before, 
and at various points during, treatment of the individual with the agent. 

In one embodiment, the invention provides a method for monitoring the effectiveness 
of treatment of a subject with an agent (e.g., an agonist, antagonist, protein, peptide, 
peptidomimetic, nucleic acid, small molecule, or other drug candidate identified by the 

30 screening assays described herein) comprising the steps of (0 obtaining a pre-administration 

sample from a subject prior to administration of the agent; (ii) detecting the level of expression 

of an NOVX protein, mRNA, or genomic DNA in the preadministration sample; (Hi) obtaining 

one or more post- administration samples from the subject; (iv) detecting the level of 

expression or activity of the NOVX protein, mRNA, or genomic DNA in the 
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post-administration samples; (v) comparing the level of expression or activity of the NOVX 
protein, mRNA, or gentfmic $NA"in the pre-admihistration sample' with the 'NOVX protein, 
mRNA, or genomic DNA in the'post administration sample or samples; and (vz) altering the 
administration of the agent to the subject accordingly. For example, increased administration 
5 of the agent may be desirable to increase the expression or activity of NOVX to higher levels 
than detected, i.e., to increase the effectiveness of the agent. Alternatively, decreased 
administration of the agent may ]?e desirable to decrease expression or .activity of NOVX to 
lower levels than detected, i.e., to decrease the effectiveness of the agent. 

Methods of Treatment, 

1 0 The.invention provides for both prophylactic and therapeutic methods of treating a 

subject at risk of (or susceptible to) a disorder or having a disorder associated with aberrant 
NOVX expression or activity. The disorders include cardiomyopathy, atherosclerosis, 
hypertension, congenital heart defects, aortic stenosis, atrial septal defect (ASD), 
atrioventricular (A-V) canal defect, ductus arteriosus, pulmonary stenosis, subaortic stenosis, 

1 5 ventricular septal defect (VSD), valve diseases, tuberous sclerosis, scleroderma, obesity, 
transplantation, adreno leukodystrophy, congenital adrenal hyperplasia, prostate cancer, 
neoplasm; adenocarcinoma, lymphoma, uterus cancer, fertility, hemophilia, hypercoagulation, 
idiopathic thrombocytopenic purpura, immunodeficiencies, graft versus host disease, AIDS, 
bronchial asthma, Crohn's disease; multiple sclerosis, treatment of Albright Hereditary 

20 Osteodystrophy, and other diseases, disorders and conditions of the like. 

These methods of treatment will be discussed more' fully, below. 

Disease and Disorders 

Diseases and disorders that are characterized by increased (relative to a subject not 
suffering from the disease or disorder) levels or biological activity may be treated with 

25 Therapeutics that antagonize (i.e., reduce or inhibit) activity. Therapeutics that antagonize 

activity may be administered in a therapeutic or prophylactic manner. Therapeutics that may 
be utilized include, but are not limited to: (/) an aforementioned peptide, or analogs, 
derivatives, fragments or homologs thereof; (ii) antibodies to an aforementioned peptide; (Hi) 
nucleic acids encoding an aforementioned peptide; (zv) administration of antisense nucleic acid 

30 and nucleic acids that are "dysfunctional" (i.e., due to a heterologous insertion within the 
coding sequences of coding sequences to an aforementioned peptide) that are utilized to 
"knockout" endogenous function of an aforementioned peptide by homologous recombination 
(see, e.g., Capecchi, 1989. Science 244: 1288-1292); or (v) modulators ( i.e., inhibitors, 
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agonists and antagonists, including additional peptide mimetic of the invention or antibodies 
specific to a peptide of the invention) that alter the interaction between an aforementioned 
peptide and its binding partner. ' 

Diseases and disorders that are characterised by decreased (relative to a subject not 
5 suffering from the disease or disorder) levels or biological activity may be treated with 

Therapeutics that increase (i.e., are agonists to) activity. Therapeutics that upregulate activity 
may be administered in a therapeutic or prophylactic manner. Therapeutics that may be ; 
utilized include, but are not limited to, an aforementioned peptide, or analogs, derivatives, 
fragments or homologs thereof; or an agonist that increases bioavailability. 

1 0 Increased or decreased levels can be readily detected by quantifying peptide and/or 

RNA, by obtaining a patient tissue sample (e.g., from biopsy tissue) and assaying it in vitro for 
RNA or peptide levels, structure and/or activity ,of the expressed peptides (or mRNAs of an 
aforementioned peptide). Methods that are well-known within the art include, but are not 
limited to, immunoassays (e.g., by Western blot analysis, immunoprecipitatiorl followed by 

15 sodium dodecyl sulfate (SDS) polyacryl amide gel electrophoresis \ immunocytochemistry, etc.) 
and/or hybridization assays to detect expression of mRNAs (e.g., Northern assays, dot blots, in 
situ hybridization, and the like). 

Prophylactic Methods 

In one aspect, the invention provides a method for preventing, in a subject, a disease or 
20 condition associated with an aberrant NOVX expression or activity, by administering to the 
subject an agent that modulates NOVX expression or at least one NOVX activity. Subjects at 
risk for a disease that is caused or contributed to by aberrant NOVX expression or activity can 
be identified by, for example, any or a combination of diagnostic or prognostic assays as 
described herein. Administration of a prophylactic agent can occur prior to the manifestation 
25 of symptoms characteristic of the NOVX aberrancy, such that a disease or disorder is 

prevented or, alternatively, delayed in its progression. Depending upon the type of NOVX 
aberrancy, for example, an NOVX agonist or NOVX antagonist agent can be used for treating 
the subject. The appropriate agent can be determined based on screening assays described 
herein. The prophylactic methods of the invention are further discussed in the following 
30 subsections. 

Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating NOVX expression 
or activity for therapeutic purposes. The modulatory method of the invention involves 
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contacting a cell with an agent that modulates one or more of the activities of NOVX protein 
"' activity associated with the cell. An agent that modulates NOVX protein activity can be an , 
agent as described herein, such as a nucleic acid or a protein, a 'naturally-occurring cognate 
ligand of an NOVX protein, a peptide, an NOVX peptidomimetic, or other small molecule. In 
5 one embodiment, the agent stimulates one or more NOVX protein activity. Examples of such 
stimulatory agents include active NOVX protein and a nucleic acid molecule encoding NOVX 
that has been introduced into the cell. In another embodiment, .the agent inhibits one or more 
NOVX protein activity. Examples of such inhibitory agents include antisense NOVX nucleic 
acid molecules and anti-NOVX antibodies. These modulatory methods can be performed in 

10 vitro (e.g., by culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering 
the agent to a subject). As such, the invention provides methods of treating an individual 
• afflicted with a disease or disorder characterized by aberrant expression or activity of an 
NOVX protein or nucleic acid molecule. In one embodiment, the method involves 
administering an agent (e.g., an agent identified by a screening assay described herein), or 

15 combination of agents that modulates (e.g., up-regulates or down-regulates) NOVX expression 
or activity. In another embodiment, the method involves administering an NOVX protein or 
nucleic acid molecule as therapy to compensate for reduced or aberrant NOVX expression or 
activity. 

Stimulation of NOVX activity is desirable in situations in which NOVX is abnormally 
20 downregulated and/or in which increased NOVX activity is likely to have a beneficial effect. 
One example of such a situation is where a subject has a disorder characterized by aberrant 
cell proliferation and/or differentiation (e.g., cancer or immune associated disorders). Another 
example of such a situation is where the subject has a gestational disease (e.g., preclampsia). 

Determination of the Biological Effect of the Therapeutic 

25 In various embodiments of the invention, suitable in vitro or in vivo assays are 

perfonned to determine the effect of a specific Therapeutic and whether its administration is 
indicated for treatment of the affected tissue. 

In various specific embodiments, in vitro assays may be performed with representative 
cells of the type(s) involved in the patient's disorder, to determine if a given Therapeutic exerts 

30 the desired effect upon the cell type(s). Compounds for use in therapy may be tested in 
suitable animal model systems including, but not limited to rats, mice, chicken, cows, 
monkeys, rabbits, and the like, prior to testing in human subjects. Similarly, for in vivo 
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testing, any of the animal model system known in the art may be used prior to administration 
to human subjects. 

Prophylactic and Therapeutic Uses of the Compositions of the Invention 

The NOVX nucleic acids and proteins of the invention are useful in potential 
5 prophylactic and therapeutic applications implicated in a variety of disorders including, but not 
limited to: metabolic disorders, diabetes, obesity, infectious disease, anorexia, cancer- 
associated cancer, neurodegenerative disorders, Alzheimer's Disease, Parkinson's Disorder, 
immune disorders, hematopoietic disorders, and the various dyslipidemias, metabolic 
disturbances associated with obesity, the metabolic syndrome X and wasting disorders 1 
10 associated with chronic diseases and various cancers. 

, , As an example, a cDNA encoding the NOVX protein of the invention may be useful in 
gene therapy, and the protein may be useful when administered to a subject in need thereof. 
By way of non-limiting example, the compositions of the invention will have efficacy for 
treatment of patients suffering from: metabolic disorders, diabetes, obesity, infectious disease, 
15 anorexia, cancer- associated cachexia, cancer, neurodegenerative disorders, Alzheimer's 

Disease, Parkinson's Disorder, immune disorders, hematopoietic disorders, and the various 
dyslipidemias. 

Both the novel nucleic acid encoding the NOVX protein, and the NOVX protein of the 
invention, or fragments thereof, may also be useful in diagnostic applications, wherein the 
20 presence or amount of the nucleic acid or the protein are to be assessed. A further use could 
be as an anti -bacterial molecule (/.£?., some peptides have been found to possess an ti -bacterial 
properties). These materials are further useful in the generation of antibodies, which 
immunospecifically-bind to the novel substances of the invention for use in therapeutic or 
diagnostic methods. 

25 The invention will be further described in the following examples, which do not limit 

the scope of the invention described in the claims. 

Examples 

Example 1: Identification of NOVX Nucleic Acids 

TblastN using CuraGen Corporation's sequence file for polypeptides or homologs was 
30 run against the Genomic Daily Files made available by GenBank or from files downloaded 
from the individual sequencing centers. Exons were predicted by homology and the 
intron/exon boundaries were determined using standard genetic rules. Exons were further 
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selected and refined by means of similarity determination using multiple BLAST (for , 
example, tBlastN, BlastX, and'Bla'fctN) searches, and, in some instances, GerieScan and GraiL 
Expressed sequences from both public and proprietary databases were also added when 
available to further define and complete the gene sequence. .The DNA sequence was then 
manually corrected for apparent inconsistencies thereby obtaining the sequences encoding the 
full-length protein. 

The novel NOVX target sequences- identified in the present invention were subjected to 
the exon linking process to confirm the sequence. PCR primers were designed by starting at 
the most upstream sequence available, for the forward primer, and at the most downstream 
sequence available for the reverse primer. PCR primer sequences were used for obtaining 
different clones. In each case, the sequence was examined, walking inward from the 
respective termini toward the coding sequence, until a suitable sequence that is either unique 
or highly selective was encountered, or, in the case of the reverse primer, until the stop codon 
was reached. Such primers were designed based. on in silico predictions for the full length 
cDNA, part (one or more exons) of the DNA or protein sequence of the target sequence, or by 
translated homology of the predicted exons to closely related human sequences from other 
species. These primers were then employed in PCR amplification based on the following pool 
of human cDNAs: adrenal gland, bone marrow;, brain - amygdala, brain - cerebellum, brain - 
hippocampus, brain - substantia nigra, brain - thalamus, brain -whole, fetal brain, fetal kidney, 
fetal liver, fetal lung, heart, kidney, lymphoma - Raji, mammary gland, pancreas, pituitary 
gland, placenta, prostate, salivary gland, skeletal muscle, small intestine, spinal cord, spleen, 
stomach, testis, thyroid, trachea, uterus. Usually the resulting amplicons were gel purified, 
cloned and sequenced to high redundancy. The PCR product derived from exon linking was 
cloned into the pCR2.1 vector from Invitrogen. The resulting bacterial clone has an insert 
covering the entire open reading frame cloned into the pCR2.1 vector. The resulting sequences 
from all clones were assembled with themselves, with other fragments in CuraGen 
Corporation's database and with public ESTs. Fragments and ESTs were included as 
components for an assembly when the extent of their identity with another component of the 
assembly was at least 95% over 50 bp. In addition, sequence traces were evaluated manually 
and edited for corrections if appropriate. These procedures provide the sequence reported 
herein. 

Physical clone: Exons were predicted by homology and the intron/exon boundaries 
were determined using standard genetic rules. Exons were further selected and refined by 
means of similarity determination using multiple BLAST (for example, tBlastN, BlastX, and 
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BlastN) searches, and, in some instances, GeneScan and Grail. Expressed sequences from both 
public and proprietary databases were also added when available to further define and 
complete the gene sequence. The DNA sequence was then manually corrected for apparent 
inconsistencies thereby obtaining the sequences encoding the full-length protein. 

5 Example 2: Identification of Single Nucleotide Polymorphisms in NOVX nucleic acid 
sequences 

Variant sequences are also included in this application. A variant sequence can include 
a single nucleotide polymorphism (SNP). A SNP can, in some instances, be refeired to as a 
''cSNP" to denote that the nucleotide sequence containing the SNP originates as a cDNA. A 1 

1 0 SNP can arise in several ways. For example, a SNP may be due to a substitution of one 

.nucleotide for another at the polymorphic site. Such a substitution can be either a transition or 
a transversion. A SNP can also arise from a deletion of a nucleotide or an insertion of a 
nucleotide, relative to a reference allele. In this case, the polymorphic site is a site at which 
one allele bears a gap with respect to a particular nucleotide in another allele. SNPs occurring 

1 5 within genes may result in an alteration of the amino acid encoded by the gene at the position 
of the SNP. Intragenic SNPs may also be silent, when a codon including a SNP encodes the 
same amino acid as a result of the redundancy of the genetic code. SNPs occurring outside the 
region of a gene, or in an intron within a gene, do not result in changes in any amino acid 
sequence of a protein but may result in altered regulation of the expression pattern. Examples 

20 include alteration in temporal expression, physiological response regulation, cell type 
expression regulation, intensity of expression, and stability of transcribed message. 

SeqCalling assemblies produced by the ex on linking process were selected and 
extended using the following criteria. Genomic clones having regions with 98% identity to all 
or part of the initial or extended sequence were identified by BLASTN searches using the 

25 relevant sequence to query human genomic databases. The genomic clones that resulted were 
selected for further analysis because this identity indicates that these clones contain the 
genomic locus for these SeqCalling assemblies. These sequences were analyzed for putative 
coding regions as well as for similarity to the known DNA and protein sequences. Programs 
used for these analyses include Grail, Genscan, BLAST, HMMER, FASTA, Hybrid and other 

30 relevant programs. 

Some additional genomic regions may have also been identified because selected 
SeqCalling assemblies map to those regions. Such SeqCalling sequences may have 
overlapped with regions defined by homology or exon prediction. They may also be included 
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because the location of the fragment was in the vicinity of genomic regions identified by 
similarity or exon prediction that had been included in the original predicted sequence. . The 
sequence so identified wa's manually assembled and theri may have been extended' using one 
or more additional sequences taken from CuraGen Corporation's human SeqCalling database. 
SeqCalling fragments suitable for inclusion were identified by the CuraTools™ program 
SeqExtend or by identifying SeqCalling fragments mapping to the appropriate regions of the 
genomic clones analyzed. , ■ ' 1 

The regions defined by the procedures described above were then manually integrated 
and corrected for apparent inconsistencies that may have arisen, for example, from miscalled 

bases in the original fragments or from discrepancies between predicted exon junctions, EST 

» 

locations and regions of sequence similarity, to derive the final sequence disclosed hereiti. 
When necessary, the process to identify and analyze SeqCalling assemblies and genomic 
clones was reiterated to derive the full length sequence (Alderbom et al., Determination of 
Single Nucleotide Polymorphisms by Real-time Pyrophosphate DNA Sequencing. Genome 
Research. 10 (8) 1249-3265, 2000). 

Example 3. Quantitative expression analysis of clones in various cells and tissues 

The quantitative expression of various clones was assessed using microtiter plates 
containing RNA samples from a variety of normal and pathology-derived cells, cell lines and 
tissues using real time quantitative PCR (RTQ PCR). RTQ PCR was performed on an Applied 
Biosystems ABI PRISM® 7700 or an ABI PRISM® 7900 HT Sequence Detection System. 
Various collections of samples are assembled on the plates, and referred to as Panel 1 
(containing normal tissues and cancer cell lines), Panel 2 (containing samples derived from 
tissues from normal and cancer sources), Panel 3 (containing cancer cell lines), Panel 4 
(containing cells and cell lines from normal tissues and cells related to inflammatory 
conditions), Panel 5D/5I (containing human tissues and cell lines with an emphasis on 
metabolic diseases), AI_comprehensive_panel (containing normal tissue and samples from 
autoimmune diseases), Panel CNSD.01 (containing central nervous system samples from 
normal and diseased brains) and CNS_neurodegeneration_panel (containing samples from 
normal and Alzheimer's diseased brains). 

RNA integrity from all samples is controlled for quality by visual assessment of 
agarose gel electropherograms using 28S and 1 8S ribosomal RNA staining intensity ratio as a 
guide (2:1 to 2.5:1 28s:18s) and the absence of low molecular weight RNAs that would be 
indicative of degradation products. Samples are controlled against genomic DNA 
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contamination by RTQ PCR reactions run in the absence of reverse transcriptase using probe 
and primer sets designed to amplify across the span of a single exon. 

First, the RNA samples were normalized to reference nucleic acid^ such as 

i 

constitutively expressed genes (for example, P-actin and GAPDH). Normalized RNA (5 ul) 
5 was converted to cDNA and analyzed by RTQ-PCR using One Step RT-PCR Master Mix 

Reagents (Applied Biosystems; Catalog No. 4309169) and 'gene-specific primers according to 
the manufacturer's instructions. 1 

In other cases, non-normalized RNA samples were converted to single strand cDNA 
(sscDNA) using Superscript II (Invitrogen Corporation; Catalog No. 18064-147) and random 
10 hexamers according to the manufacturer's instructions. Reactions containing up to 10 ug of 
total RNA were performed in a volume of 20 u.1 and incubated for 60 minutes at 42°C. This 
reaction can be scaled up to 50 ug of total RNA in a final volume o5f 1 00 sscDNA samples 
are then normalized to reference nucleic acids as described previously, using IX TaqMan® 
Universal Master mix (Applied Biosystems; catalog No. 4324020), following'the 
15 manufacturer's instructions. 

Probes and primers were designed for each assay according to Applied Biosystems 
Primer Express Software package (version I for Apple Computer's Macintosh Power PC) or a 
similar algorithm using the target sequence as input. Default settings were used for reaction 
conditions and the following parameters were set before selecting primers: primer 
20 concentration = 250 nM, primer melting temperature (Tm) range = 58°-60 D C, primer optimal 
Tm = 59°C, maximum primer difference = 2°C, probe does not have 5'G, probe Tm must be 
10°C greater than primer Tm, amplicon size 75bp to lOObp. The probes and primers selected 
(see below) were synthesized by Synthegen (Houston, TX, USA). Probes were double purified 
by HPLC to remove uncoupled dye and evaluated by mass spectroscopy to verify coupling of 
25 reporter and quencher dyes to the 5* and 3' ends of the probe, respectively. Their final 
concentrations were: forward and reverse primers, 900nM each, and probe, 200nM. 

PCR conditions: When working with RNA samples, normalized RNA from each tissue 
and each cell line was spotted in each well of either a 96 well or a 384-well PCR plate 
(Applied Biosystems). PCR cocktails included either a single gene specific probe and primers 
30 set, or two multiplexed probe and primers sets (a set specific for the target clone and another 
gene-specific set multiplexed with the target probe). PCR reactions were set up using 
TaqMan® One-Step RT-PCR Master Mix (Applied Biosystems, Catalog No. 4313803) 
following manufacturer's instructions. Reverse transcription was performed at 48°C for 30 
minutes followed by amplification/PCR cycles as follows: 95°C 10 min, then 40 cycles of 
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95°C for 1 5 seconds, 60°C for 1 minute. Results were recorded as CT values (cycle at which a 
■•< given sample crosses a threshold level of fluorescence) using a log scale, with the difference ip 
RNA concentration between a given sample and the sample with the lowest CT value being 
represented as 2 to the power of delta CT. The percent relative expression is then obtained by 
5 taking the reciprocal of this RNA difference and multiplying by 100. 

When working with sscDNA samples, nonnalized sscDNA was used as described 
previously for RNA samples. PCR reactions containing one or,two sets of probe and primers 
were set up as described previously, using IX TaqMan® Universal Master mix (Applied 
Biosystems; catalog No. 4324020), following the manufacturer's instructions. PCR 
1 0 amplification was performed as follows: 95°C 1 0 min, then 40 cycles of 95°C for 1 5 second^, 
60°C for 1 minute. Results were analyzed and processed as described previously. 
, Panels 1, LI, 1.2, and 1.3D 

The plates for Panels 1, 1.1, 1.2 and 1.3D include 2 control wells (genomic DNA 
control and chemistry control) and 94 wells containing cDNA from various samples. The 
1 5 samples in these panels are broken into 2 classes: samples derived from cultured cell lines and 
samples derived from primary normal tissues. The cell lines are derived from cancers of the 
following types: lung cancer, breast cancer, melanoma, colon cancer, prostate cancer, CNS 
cancer, squamous cell carcinoma, ovarian cancer, liver cancer, renal cancer, gastric cancer and 
pancreatic cancer. Cell lines used in these panels are widely available through the American 
20 Type Culture Collection (ATCC), a repository for cultured cell lines, and were cultured using 
the conditions recommended by the ATCC. The normal tissues found on these panels are 
comprised of samples derived from all major organ systems from single adult individuals or 
fetuses. These samples are derived from the following organs: adult, skeletal muscle, fetal 
skeletal muscle, adult heart, fetal heart, adult kidney, fetal kidney, adult liver, fetal liver, adult 
25 lung, fetal lung, various regions of the brain, the spleen, bone marrow, lymph node, pancreas, 
salivary gland, pituitary gland, adrenal gland, spinal cord, thymus, stomach, small intestine, 
colon, bladder, trachea, breast, ovary, uterus, placenta, prostate, testis and adipose. 

In the results for Panels 1, 1.1, 1.2 and 1.3D, the following abbreviations are used: 
ca. = carcinoma, 
30 * = established from metastasis, 

met = metastasis, 
s cell var - small cell variant, 
non-s = non-sm = non-small, 
squam = squamous, 
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pi. eff - pi effusion = pleural effusion, 
glio - glioma, 

astro - astrocytoma, and * '. 

i 

neuro - neuroblastoma. 
5 General screening jpanel_vl. 4 

The plates for Panel 1.4 include 2 control wells (genomic DNA control and chemistry 

control) and 94 wells containing cDNA from various samples. 'The samples in Panel 1 .4 are 

broken into 2 classes: samples derived from cultured cell lines and samples derived from 

primary nomial tissues. The cell 1 lines are derived from cancers of the following types: lung 
i i 
10 cancer, breast cancer, melanoma, colon cancer, prostate cancer, CNS cancer, squamous cell 

carcinoma, ovarian cancer, liver cancer, renal cancer, gastric cancer and pancreatic cancer. 

1 Cell lines used in Panel 1.4 are widely available through the American Type Culture 

Collection (ATCC), a repository for cultured cell lines, and were cultured using the conditions 

recommended by the ATCC. The normal tissues found on Panel 1.4 are comprised of pools of 

15 samples derived from all major organ systems from 2 to 5 different adult individuals or 
fetuses. These samples are derived from the following organs: adult skeletal muscle, fetal 
skeletal muscle, adult heart, fetal heart, adult kidney, fetal kidney, adult liver, fetal liver, adult 
lung, fetal lung, various regions of the brain, the spleen, bone marrow, lymph node, pancreas, 
salivary gland, pituitary gland, adrenal gland, spinal cord, thymus, stomach, small intestine, 

20 colon, bladder, trachea, breast, ovary, uterus, placenta, prostate, testis and adipose. 
Abbreviations are as described for Panels 1, 1.1, 1.2, and 1.3D, 
Panels 2D and 2,2 

The plates for Panels 2D and 2.2 generally include 2 control wells and 94 test samples 
composed of RNA or cDNA isolated from human tissue procured by surgeons working in 

25 close cooperation with the National Cancer Institute's Cooperative Human Tissue Network 
(CHTN) or the National Disease Research Initiative (NDRI). The tissues are derived from 
human malignancies and in cases where indicated many malignant tissues have "matched 
margins" obtained from noncancerous tissue just adjacent to the tumor. These are termed 
normal adjacent tissues and are denoted "NAT" in the results below. The tumor tissue and the 

30 "matched margins" are evaluated by two independent pathologists (the surgical pathologists 
and again by a pathologist at NDRI or CHTN). This analysis provides a gross 
histopathological assessment of tumor differentiation grade. Moreover, most samples include 
the original surgical pathology report that provides information regarding the clinical stage of 
the patient. These matched margins are taken from the tissue surrounding (i.e. immediately 
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proximal) tothe zone of surgery (designated "NAT", for normal adjacent tissue, in Table RR). 
In addition, RNA and c£>NA samples were obtained from various human tissues derived from 
autopsies performed on elderly people or sudden death victims (accidents; etc.). These tissues 
were ascertained to be free of disease and were purchased from various commercial sources 
5 such as Clontech (Palo Alto, CA), Research Genetics, and Invitrogen. 
Panel 3D 

The plates of Panel 3D are comprised of 94 cDNA samples and two control safnples. 
Specifically, 92 of these samples are derived from cultured human cancer cell lines, 2 samples 
of human primary cerebellar tissue, and 2 controls. The human cell lines are generally obtained 

10 from ATCC (American Type Culture Collection), NCI or the German tumor cell bank and fall 
into the following tissue groups: Squamous cell carcinoma of the tongue, breast cancer, ' 
prostate cancer, melanoma, epidermoid carcinoma, sarcomas, bladder carcinomas, pancreatic 
cancers, kidney cancers, leukemias/lymphomas, ovarian/uterine/cervical, gastric, colon, lung 
and CNS cancer cell lines. In addition, there are two independent samples of Cerebellum. 

1 5 These cells are all cultured under standard recommended conditi6ns and RNA extracted using 
the standard procedures. The cell lines in panel 3D and 1.3D are of the most common cell lines 
used in the scientific literature. 
Panels 4D ? 4R, and 4.1D 

Panel 4 includes samples on a 96 well plate (2 control wells, 94 test samples) 

20 composed of RNA (Panel 4R) or cDNA (Panels 4D/4.1D) isolated from various human cell 
lines or tissues related to inflammatory conditions. Total RNA from control normal tissues 
such as colon and lung (Stratagene, La Jolla, CA) and thymus and kidney (Clontech) was 
employed. Total RNA from liver tissue from cirrhosis patients and kidney from lupus patients 
was obtained from BioChain (Biochain Institute, Inc., Hayward, CA). Intestinal tissue for 

25 RNA preparation from patients diagnosed as having Crohn's disease and ulcerative colitis was 
obtained from the National Disease Research Interchange (NDRJ) (Philadelphia, PA). 

Astrocytes, lung fibroblasts, dermal fibroblasts, coronary artery smooth muscle cells, 
small airway epithelium, bronchial epithelium, microvascular dermal endothelial cells, 
microvascular lung endothelial cells, human pulmonary aortic endothelial cells, human 

30 umbilical vein endothelial cells were all purchased from Clonetics (Walkersville, MD) and 
grown in the media supplied for these cell types by Clonetics. These primary cell types were 
activated with various cytokines or combinations of cytokines for 6 and/or 12-14 hours, as 
indicated. The following cytokines were used; IL-T beta at approximately l-5ng/ml, TNF 
alpha at approximately 5-10ng/ml, IFN gamma at approximately 20-50ng/ml, IL-4 at 
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approximately 5-10ng/ml,.IL-9 at approximately 5-10ng/ml, IL-13 at approximately 5- 
lOng/ml. Endothelial cells were sometimes starved for various times by culture in the basal 
media from Clonetics with 0A% serum. 

Mononuclear cells were prepared from blood of employees at CuraGen Corporation, 
5 using Ficoll. LAK cells were prepared from these cells by culture in DMEM 5% FCS 

(Hyclone), 1 OO^M non essential amino acids (Gibco/Life Technologies, Rockville, MD) } 
ImM sodium pyruvate (Gibco), imercaptoethanol 5.5x1 0" 5 M (Gibco), and lOmM Hepes ; 
(Gibco) and Interleukin 2 for 4-6 days. Cells were then either activated with 10-20ng/ml PMA 
and l-2ng/ml' ionomycip, IL-12 at.5-10ng/ml, IFN gamma at 20-50ng/ml and IL-18 at 5- 
10 1 Ong/ml for 6 hours. In some cases, mononuclear cells were cultured for 4-5 days in DMEM 
5% FCS (Hyclone), IOOjiM non essential amino acids (Gibco), ImM sodium pyruvate 
(Gibco), mercaptoethanol 5.5xlO" 5 M (Gibco), and lOmM Hepes (dibco) with PHA 
(phytohemagglutinin) or PWM (pokeweed mitogen) at approximately 5juig/ml. Samples were 
taken at 24, 48 and 72 hours for RNA preparation. MLR (mixed lymphocyte reaction) samples 
1 5 were obtained by taking blood from two donors, isolating the mononuclear cells using Ficoll 
and mixing the isolated mononuclear cells 1:1 at a final concentration of approximately 
2xl0 6 cells/ml in DMEM 5% FCS (Hyclone), lOOpM non essential amino acids (Gibco), ImM 
sodium pyruvate (Gibco), mercaptoethanol (5.5x 10" 5 M) (Gibco), and lOmM Hepes (Gibco). 
The MLR was cultured and samples taken at various time points ranging from 1-7 days for 
20 RNA preparation. 

Monocytes were isolated from mononuclear cells using CD14 Miltenyi Beads, +ve VS 
selection columns and a Vario Magnet according to the manufacturer's instructions. 
Monocytes were differentiated into dendritic cells by culture in DMEM 5% fetal calf serum 
(FCS) (Hyclone, Logan, UT), 100/iM non essential amino acids (Gibco), ImM sodium 
25 pyruvate (Gibco), mercaptoethanol 5.5xlO" 5 M (Gibco), and lOmM Hepes (Gibco), 50ng/ml 
GMCSF and 5ng/ml IL-4 for 5-7 days. Macrophages were prepared by culture of monocytes 
for 5-7 days in DMEM 5% FCS (Hyclone), ] 00/^M non essential amino acids (Gibco), 3mM 
sodium pyruvate (Gibco), mercaptoethanol 5.5xl0" 5 M (Gibco), lOmM Hepes (Gibco) and 
10% AB Human Serum or MCSF at approximately 50ng/ml. Monocytes, macrophages and 
30 dendritic cells were stimulated for 6 and 12-1 4 hours with lipopolysaccharide (LPS) at 
lOOng/ml. Dendritic cells were also stimulated with anti-CD40 monoclonal antibody 
(Pharmingen) at 1 Ojag/ml for 6 and 12-14 hours. 

CD4 lymphocytes, CD 8 lymphocytes and NX cells were also isolated from 
mononuclear cells using CD4, CD8 and CD56 Miltenyi beads, positive VS selection columns 
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and a Vario Magnet according to the manufacturer's instructions. CD45RA and CD45RO CD4 
" lymphocytes were isolated by depleting mononuclear cells of CD8, CD56, CD14 and CD19 . 
cells using CD8, CD56, CD 14 and CD 19 Miltenyi beads and positive selection. CD45RO 
beads were then used to isolate the CD45RO CD4 lymphocytes with the remaining cells being 
5 CD45RA CD4 lymphocytes. CD45RA CD4, CD45RO CD4 and CD 8 lymphocytes were 
placed in DMEM 5% FCS (Hyclone), 1 OOjjM non essential amino acids (Gibco), ImM 
sodium pyruvate (Gibco), mercaptoethanol 5.5x1 0~fM (Gibco)j and lOmM Hepes (Gibco) and 
plated at 1 0 6 cells/ml onto Falcon 6 well tissue culture plates that had been coated overnight 
with 0.5pg/ml anti-CD28 (Pharmingen) and 3ug/ml anti-CD3 (OKT3, ATCC) in PBS. After 6 

10 and 24 hours, the cells were harvested for RNA preparation. To prepare chronically activated 
CD8 lymphocytes, we activated the isolated CD8 lymphocytes for 4 days on anti-CD28 and 
• anti-CD3 coated plates and then harvested the cells and expanded them in DMEM 5% FCS 
(Hyclone), lOOuM non essential amino acids (Gibco), ImM sodium pyruvate (Gibco), 
mercaptoethanol 5.5x1 0" 5 M (Gibco), and lOmM Hepes (Gibco) and IL-2. The expanded CD8 

15 cells were then activated again with plate bound anti-CD3 and anti-CD28 for 4 days and 

expanded as before. RNA was isolated 6 and 24 hours after the second activation and after 4 
days of the second expansion culture. The isolated NK cells were cultured in DMEM 5% FCS 
(Hyclone), lOOpM non essential amino acids (Gibco), ImM sodium pyruvate (Gibco), 
mercaptoethanol 5.5x1 0~ 5 M (Gibco), and lOmM Hepes (Gibco) and IL-2 for 4-6 days before 

20 RNA was prepared. 

To obtain B cells, tonsils were procured from NDRJ. The tonsil was cut up with sterile 
dissecting scissors and then passed through a sieve. Tonsil cells were then spun down and 
resupended at 10 6 cells/ml in DMEM 5% FCS (Hyclone), lOO^iM non essential amino acids 
(Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x1 0" 5 M (Gibco), and lOmM 

25 Hepes (Gibco). To activate the cells, we used PWM at 5pg/ml or anti-CD40 (Pharmingen) at 
approximately lOug/ml and IL-4 at 5-lOng/ml. Cells were harvested for RNA preparation at 
24,48 and 72 hours. 

To prepare the primary and secondary Thl/Th2 and Trl cells, six-well Falcon plates 
were coated overnight with lOpg/ml anti-CD28 (Pharmingen) and 2pg/ml OKT3 (ATCC) 5 and 
30 then washed twice with PBS. Umbilical cord blood CD4 lymphocytes (Poietic Systems, 

German Town, MD) were cultured at 10 5 -10 6 cel ls/ml in DMEM 5% FCS (Hyclone), lOOpM 
non essential amino acids (Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5.5x10" 
5 M (Gibco), lOmM Hepes (Gibco) and IL-2 (4ng/ml). IL-12 (5ng/ml) and antML4 (lpg/ml) 
were used to direct to Thl, while IL-4 (5ng/ml) and anti-IFN gamma (1 pg/ml) were used to 
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direct to Th2 and IL-10 at 5ng/ml was used to direct to Trl. After 4-5 days, the activated Thl, 
Th2 and Trl lymphocytes were washed once in DMEM and expanded for 4-7 days in DMEM 
5% FCS (Hyclone), 1'OOjuM non 'essential amino acids (Gibco), ImM sodiurn pyruvate 
(Gibco), mercaptoethanol 5.5x1 0" 5 M (Gibco), lOmM Hepes (Gibco) and IL-2 (Ing/ml). 
Following this, the activated Thl , Th2 and Trl lymphocytes were re-stimulated for 5 days 
with anti~CD28/OKT3 and cytokines as described above, but with the addition of anti-CD95L 
OHg/ml) to prevent apoptosis. After 4-5 days, the Thl,Th2 and Trl lymphocytes were' , 
washed and then expanded again with IL-2 for 4-7 days. Activated Thl and Th2 lymphocytes 
were maintained in this way for a rnaxirnum of three cycles. RNA was prepared from primary 
and secondary Thl, Th2 and Trl after 6 and 24 hours following the second and third 
activations with plate bound anti-CD3 and anti-CD28 mAbs and 4 days into the second and 
third expansion cultures in Xnterleukin 2. 1 

The following leukocyte cells lines were obtained from the ATCC: Ramos, EOL-l, 
KU-812. EOL cells were further differentiated by culture in 0.1 mM dbcAMP at 5x 1 0 5 cells/ml 
for 8 days, changing the media every 3 days and adjusting the cell concentration to 
5xl0 5 cells/ml. For the culture of these cells, we used DMEM or RPMI (as recommended by 
the ATCC), with the addition of 5% FCS (Hyclone), 1 OOjiM non essential amino acids 
(Gibco), ImM sodium pyruvate (Gibco), mercaptoethanol 5,5x1 0" 5 M (Gibco), lOmM Hepes 
(Gibco). RNA was either prepared from resting cells or cells activated with PMA at lOng/ml 
and ionomycin at lpg/ml for 6 and 14 hours. Keratinocyte line CCD 106 and an airway 
epithelial tumor line NCI-H292 were also obtained from the ATCC. Both were cultured in 
DMEM 5% FCS (Hyclone), lOOuM non essential amino acids (Gibco), ImM sodium pyruvate 
(Gibco), mercaptoethanol 5.5xl0" 5 M (Gibco), and lOmM Hepes (Gibco). CCD1106 cells were 
activated for 6 and 14 hours with approximately 5 ng/ml TNF alpha and Ing/ml IL-1 beta, 
while NCI-H292 cells were activated for 6 and 14 hours with the following cytokines: 5ng/ml 
IL-4, 5ng/ml IL-9, 5ng/mJ IL-1 3 and 25ng/ml IFN gamma. 

For these cell lines and blood cells, RNA was prepared by lysing approximately 
10 7 cells/m! using Trizol (Gibco BRL). Briefly, 1/10 volume of bromochloropropane 
(Molecular Research Corporation) was added to the RNA sample, vortex ed and after 10 
minutes at room temperature, the tubes were spun at 14,000 rpm in a Sorvall SS34 rotor. The 
aqueous phase was removed and placed in a 15ml Falcon Tube. An equal volume of 
isopropanol was added and left at -20°C overnight. The precipitated RNA was spun down at 
9,000 rpm for 15 min in a Sorvall SS34 rotor and washed in 70% ethanol. The pellet was 
redissolved in 300^1 of RNAse-free water and 35 ul buffer (Promega) 5j.il DTT, 7jil RNAsin 
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and 8uJ DNAse were added. The tube was incubated at 37°C for 30 minutes to remove 
contaminating genomic DNA, extracted once with phenol chloroform and re-precipitated with , 
1/10 volume of 3M sodium acetate and 2 Volumes of 1 00% ethanol. The RNA was spun down 
and placed in RNAse free water. RNA was stored at ~80°C. • 
5 AI comprehensive panel vl.O 

The plates for Al_comprehensive paneljvl.O include two control wells and 89 test 
samples comprised of cDNA isolated from surgical arjd postmortem human tissues obtained 
from the Backus Hospital and Clinomics (Frederick, MD). Total RNA was extracted from 
tissue samples from the Backus Hdspital in the Facility at CuraGen. Total RNA from other 
0 tiskues was obtained from Clinomics.. 

Joint tissues including synovial fluid, synovium, bone and cartilage were obtained from 
patients undergoing total knee or hip replacement surgery at the Backus Hospital. Tissue 
samples were immediately snap frozen in liquid nitrogen to ensure that isolated RNA was of 
optimal quality and not degraded. Additional samples of osteoarthritis and rheumatoid arthritis 
5 joint tissues were obtained from Clinomics. Normal control tissues were supplied by 
Clinomics and were obtained during autopsy of trauma victims. 

Surgical specimens of psoriatic tissues and adjacent matched tissues were provided as 
total RNA by Clinomics. Two male and two female patients were selected between the ages of 
25 and 47. None of the patients were taking prescription drugs at the time samples were 
0 isolated. 

Surgical specimens of diseased colon from patients with ulcerative colitis and Crohns 
disease and adjacent matched tissues were obtained from Clinomics. Bowel tissue from three 
female and three male Crohn's patients between the ages of 41-69 were used. Two patients 
were not on prescription medication while the others were taking dexamethasone, 

5 phenobarbital, or tylenol. Ulcerative colitis tissue was from three male and four female 
patients. Four of the patients were taking lebvid and two were on phenobarbital. 

Total RNA from post mortem lung tissue from trauma victims with no disease or with 
emphysema, asthma or COPD was purchased from Clinomics. Emphysema patients ranged in 
age from 40-70 and all were smokers, this age range was chosen to focus on patients with 

0 cigarette-linked emphysema and to avoid those patients with alpha- lanti -trypsin deficiencies. 
Asthma patients ranged in age from 36-75, and excluded smokers to prevent those patients that 
could also have COPD. COPD patients ranged in age from 35-80 and included both smokers 
and non-smokers. Most patients were taking corticosteroids, and bronchodilators. 
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In the labels employed to identify tissues in the AI_comprehensive paneljyl.O panel, 
the following abbreviations are used: 

AI = Autoimmunity ' 
Syn = Synovial 
5 Normal = No apparent disease 

Rep22 /Rep20 = individual patients ■ 
RA = Rheumatoid arthritis 1 
Backus = From Backus Hospital 
OA - Osteoarthritis 
1 0 1 (SS) (BA) (MF) - Individual patients 

Adj - Adjacent tissue 
Match control = adjacent tissues 
-M = Male 
-F = Female 

1 5 COPD = Chronic obstructive pulmonary disease . 

Panels 5D and 51 

The plates for Panel 5D and 51 include two control wells and a variety of cDNAs 
isolated from human tissues and cell lines with an emphasis on metabolic diseases. Metabolic 
tissues were obtained from patients enrolled in the Gestational Diabetes study. Cells were 
20 obtained during different stages in the differentiation of adipocytes from human mesenchymal 
stem cells. Human pancreatic islets were also obtained. 

In the Gestational Diabetes study subjects are young (18-40 years), otherwise healthy 
women with and without gestational diabetes undergoing routine (elective) Caesarean section. 
After delivery of the infant, when the surgical incisions were being repaired/closed, the 
25 obstetrician removed a small sample (<1 cc) of the exposed metabolic tissues during the 

closure of each surgical level. The biopsy material was rinsed in sterile saline, blotted and fast 
frozen within 5 minutes from the time of removal. The tissue was then flash frozen in liquid 
nitrogen and stored, individually, in sterile screw-top tubes and kept on dry ice for shipment to 
or to be picked up by CuraGen. The metabolic tissues of interest include uterine wall (smooth 
30 muscle), visceral adipose, skeletal muscle (rectus) and subcutaneous adipose. Patient 
descriptions are as follows: 

Patient 2: Diabetic Hispanic, overweight, not on insulin 
Patient 7-9: Nondiabetic Caucasian and obese (BMI>30) 
Patient 10: Diabetic Hispanic, overweight, on insulin 
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Patient 1 1 : Nondiabetic African American and overweight 
Patient 12: Diabetic Hispanic on insulin 

Adipocyte differentiation was induced in donor progenitor cells obtained from Osirus 
(a division of Clonetics/BioWhittaker) in triplicate, except for Donor 3U which had only two 
5 replicates. Scientists at Clonetics isolated, grew and differentiated human mesenchymal stem 
cdls (HuMSCs) for CuraGen based on the published protocol found in Mark F. Pittenger, et 
al., Multilineage Potential of Adult Humari Mesenchymal Stem Cells Science Apr 2 1S*99; 
143-147. Clonetics provided Trizol lysates or frozen pellets suitable for mRNA isolation and 
ds cDNA production. A .general description of each donor is as follows: 
10 Donor 2 and 3 U: Mesenchymal Stem cells, Undifferentiated Adipose 

Donor 2 and 3 AM: Adipose, Adipose Midway Differentiated 

Donor 2 and 3 AD: Adipose, Adipose Differentiated 

Human cell lines were generally obtained from ATCC (American Type Culture 
Collection), NCI or the German tumor cell bank and fall into the following tissue groups: 
15 kidney proximal convoluted tubule, uterine smooth muscle cells, small intestine, liver HepG2 
cancer cells, heart primary stromal cells, and adrenal cortical adenoma cells. These cells are all 
cultured under standard recommended conditions and RNA extracted using the standard 
procedures. All samples were processed at CuraGen to produce single stranded cDNA. 

Panel 51 contains all samples previously described with the addition of pancreatic islets 
20 from a 58 year old female patient obtained from the Diabetes Research Institute at the 

University of Miami School of Medicine. Islet tissue was processed to total RNA at an outside 
source and delivered to CuraGen for addition to panel 51. 

In the labels employed to identify tissues in the 5D and 51 panels, the following 
abbreviations are used: 
25 GO Adipose = Greater Omentum Adipose 

SK = Skeletal Muscle 

UT = Uterus 

PL = Placenta 

AD = Adipose Differentiated 
30 AM = Adipose Midway Differentiated 

U = Undifferentiated Stem Cells 
Panel CNSD.01 

The plates for Panel CNSD.01 include two control wells and 94 test samples 
comprised of cDNA isolated from postmortem humari brain tissue obtained from the Harvard 
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Brain Tissue Resource Center. Brains are removed from calvaria of donors between 4 and 24 
hours after death, sectioned by neuroanatomists, and frozen at -80°C in liquid nitrogen vapor. 
All brains are sectioned and examined by neuropathologists to confirm diagnoses with clear 
associated neuropathology. 
5 Disease diagnoses are taken from patient records. The panel contains two brains from 

each of the following diagnoses: Alzheimer's disease, Parkinson's disease, Huntington's 
disease, Progressive Supernuclear Palsy, Depression, and "Normal controls". Within each ;of 
these brains, the following regions are represented: cingulate gyrus, temporal pole, globus 
palladus, substantia nigra, Brodman Area 4 (primary motor strip), Brodman Area 7 (parietal 

10 cortex), Brodman Area 9 (prefrontal cortex), and Brodman area 17 (occipital cortex). Not all 
brain regions are represented in al] cases; e.g., Huntington's disease is characterized in part by 
neurodegeneration in the globus palladus, thus this region is impossible to obtain from 
confirmed Huntington's cases. Likewise Parkinson's disease is characterized by degeneration 
of the substantia nigra making this region more difficult to obtain. Normal control brains were 

1 5 examined for neuropathology and found to.be free of any pathology consistent with 
neurodegeneration. 

In the labels employed to identify tissues in the CNS panel, the following abbreviations 
are used: 

PSP = Progressive supranuclear palsy 
20 Sub Nigra = Substantia nigra 

Glob Palladus- Globus palladus 
Temp Pole = Temporal pole 
Cing Gyr = Cingulate gyrus 
B A 4 = Brodman Area 4 
25 Panel CNS Neurodegeneration Vl.O 

The plates for Panel CNS_Neurodegeneration_V1.0 include two control wells and 47 
test samples comprised of cDNA isolated from postmortem human brain tissue obtained from 
the Harvard Brain Tissue Resource Center (McLean Hospital) and the Human Brain and 
Spinal Fluid Resource Center (VA Greater Los Angeles Healthcare System). Brains are 
30 removed from calvaria of donors between 4 and 24 hours after death, sectioned by 

neuroanatomists, and frozen at -80°C in liquid nitrogen vapor. All brains are sectioned and 
examined by neuropathologists to confirm diagnoses with clear associated neuropathology. 

Disease diagnoses are taken from patient records. The panel contains six brains from 
Alzheimer's disease (AD) patients, and eight brains from "Normal controls" who showed no 
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evidence of dementia prior to death. The eight normal control brains are divided into two 
categories: Controls with no dementia and no Alzheimer's like pathology (Controls) and 
controls with no dementia but evidence of severe Alzheimer's like pathology, (specifically 
senile plaque load rated as level 3 on a scale of 0-3; 0 = no evidence of plaques, 3 — severe AD 
5 senile plaque load). Within each of these brains, the following regions are represented: 
hippocampus, temporal cortex (Brodman Area 21), parietal cortex (Brodman area 7), and 
occipital cortex (Brodman area 17). These regions \Vere chosen to encompass all levels of 
neurodegeneration in AD. The hippocampus is a region of early and severe neuronal loss in 
AD; the temporal cortex is known to show neurodegeneration in AD after the hippocampus; 

10 the parietal cortex shows moderate neuronal death in the late stages of the disease; the 

occipital cortex is spared in AD and therefore acts as a "control" region within AD patients. 
Not all brain regions are represented in all cases. 

In the labels employed to identify tissues in the CNS_Neurodegeneration_V1.0 panel, 
the following abbreviations are used: 

15 AD = Alzheimer's disease brain; patient was demented and showed AD-like pathology 

upon autopsy 

Control = Control brains; patient not demented, showing no neuropathology 
Control (Path) = Control brains; pateint not demented but showing sever AD-like 
pathology 

20 SupTemporal Ctx = Superior Temporal Cortex 

Inf Temporal Ctx = Inferior Temporal Cortex 

A.NOVl:TEN-M4-like 

Expression of the NOV! gene (CG5 609 1-01) was assessed using the primer-probe sets 
25 Ag2581, Ag2910 and Agl479 ? described in Tables 20-22. Results of the RTQ-PCR runs are 
shown in Tables 23-27. 
Table 20 . Probe Name Ag2581 



Primers 


Sequences 


Length 


Start 
Position 


Forward 


5 ' - tgaccacagacatcatcagtgt-3 ' (SEQ ID NO;190) 


22 


7821 


Probe 


TET-5 » -ccatcttgaaccatgcccactaccta-3 1 -TAMRA (SEQ 
ID NO; 191} 


26 


7872 


Reverse, 


5 » ~tcaatggtgaagtgcaggtt-3 » (SEQ ID NO: 192) 


20 


7901 


Table 21 . Probe Name Ag2910 


Primers, 


Sequences 


Length 


Start 
Position 
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Forwardjs ' -tgaccacagacatcatcagtgt-3 ' (SEQ ID NO: 193) 


1 » 


7821 


Probe 


TET-5 r -ccatatetgaa'ccatgcccactaccta-.B ' -TAMRA (SEfc 
ID NO: 194) 


26 


7872 


Reverse^' -tcaatggtgaagtgcaggtt-3' {SEQ ID NO': 195) 


1 '26 | 


7901 


Table 22. Probe Name Agl479 


Primers 


Sequences 


Length 


Start 
Position 


Forward 


5 ' -cacggaacgtatcttcaagaaa-3 ' (SEQ ID NO:196) 


22 ! 


2108 


Probe 


TET-5 • -ctgcacgtgtgaccctaactggactg-3 ' tTAMRA (SEQ , 
ID NO:197) ' 


26 ' 


2137; 


Reverse 


5 • -gccacagtccacagaacatatt-3 » (SEQ ID NO:198) 


22 


2182 



Table 23 . CNS^neurodegeneration^vl.O 



Tissue Name 


Rel. Exp.(%) 
Ag258l,Run 
208777162 


Rel. £xp.(%) 
Ag29J0,Run. 
209735201 


Tissue Name 


Rel. Exp.(%) 
Ag2581, Run 
208777162 


Rel. Exp.(%) 
Ag2910,Run 
209735201 


AD 1 Hippo 


8.8 


11.0 


Control (Path) 
3 Temporal Ctx 


'1.5 


2.1 


AD 2 Hippo 


28.5 


i ' 26.4 j 


Control (Path) j 
4 Temporal Ctx 


27.7 


25.2 


AD 3 Hippo 




6,1 


AD 1 Occipital 
Ctx 


13.4 


13.2 


AD 4 Hippo 


8.5 


7.1 


AD 2 Occipital 
Ctx (Missing) 


0.0 


0.0 


AD 5 Hippo 


94.0 


100.0 


AD 3 Occipital 
Ctx 


1.7 


3.7 


AD 6 Hippo 


67.8 


66.9 


AD 4 Occipital 
Ctx 


31.0 


14.3 


Control 2 
Hippo 


42.6 


45.1 


AD 5 Occipital 
Ctx 


57.0 


55.9 


Control 4 
Hippo 


9.7 


11.0 


AD 6 Occipital 

Ctx , . 


16.2 


15.8 


Control (Path) 
3 Hippo 


3.8 


2.6 


Control 1 
Occipital Ctx ! 


1.4 


1.0 


AD 1 

Temporal Ctx ; 


9.3 


11.8 


Control 2 
Occipital Ctx 


72.7 


69.7 


AD 2 

Temporal Ctx 


26.8 


27.0 


Control 3 
Occipital Ctx 


16.0 


13.2 


AD 3 

Temporal Ctx 


5.0 


4.0 


Control 4 
Occipital Ctx 


5.4 : 


6.8 


AD 4 

Temporal Ctx 


22,8 


24.1 


Control (Path) 
1 Occipital Ctx 


93.3 i 


95.9 


AD 5 Inf 
Temporal Ctx 


100.0 


94.6 


Control (Path) ' 
2 Occipital Ctx : 


8.6 


9.4 


AD 5 Sup 
Temporal Ctx 


34.2 


36.9 


Control (Path) 
3 Occipital Ctx ! 


0.9 


1.1 


AD 6 Inf 
Temporal Ctx 


47.3 


53.2 


Control (Path) 
4 Occipital Ctx : 


17.1 


15.2 


AD 6 Sup 
Temporal Ctx 


47.6 


40.9 


Control 1 
Parietal Ctx 


2.1 


5.1 


Control 1 
Temporal Ctx 


2.4 


1.9 


Control 2 
Parietal Ctx 


35.6 


44.4 
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Control 2 
Temporal Ctx 


44.8 


44.8 


Control 3 
ParietaJ Ctx 


17.8 


14.6 


Control 3 
Temporal Ctx 


10.4 


in ■ 


Control (Patn) . 
1 Pariefal Ctx 


78.5 


74.2 


Control 3 , 
Temporal Ctx 


8.2 


7.5 


Control (Path) ; 
2 Parietal Ctx 


19.5 


21.8* 


Control (Path) 
1 Temporal 
Ctx 


80.1 


68.3 


Control (Path) 
3 Parietal Ctx 


1.1 


2.0 


Control (Path) 
2 Temporal 
Ctx 


36.6 


29.3 


Control (Path) : 
4 Parietal Ctx 


43.2 


37.9 



Table 24. Panel 1.3D 



1 

Tissue Name 


Rel. 
Exp.(%) 

a „.i Ana 
Agl47y, 

Run 
165520101 


Rel. 

JExp.(%) 

Run 

162292620 


Rel. 
Exp.(%) 

Run 
162556486 


TiceiiA n tno 
M. JSMlC 1 l#J Jiie 


Rel. j 
Exp.(%) 

Run 
165520101 


Rel. ( 
Exp.(%) j 

Run 

162292620 


Rel. , 
Exp.(%) 
A o 29l0 
Run 
162556486 


Liver 

adenocarcinoma ; 


16.0 


0.0 


0.0 


Kidney 
(fetal) 


2.8 


5.8 | 


4.7 


Pancreas 


0.5 


0.2 


0.0 


786-0 


11.2 ; 


1.7 | 


0.1 


Pancreatic ca. j 


16.2 


0.5 


0.0 


Renal ca. 
A498 1 


13.1 ! 


0.8 


0.9 


Adrenal gland j 


4.1 


0.3 1 


0.4 


Renal ca. 
RXF393 


21.5 j 


8.8 


4.7 


Thyroid j 


2.0 


5.4 


5.3 


Renal ca. i 
ACHN 


10.1 


4.0 


5.0 


Salivary gland 


0.2 


0.5 


0.7 


Renal ca. 

UO-31 ; 


10.2 | 


13.7 


13.9 


Pituitary gland j 


3.5 


11.1 


8.1 


Renal ca. 
TK-10 


0.0 j 


2.9 j 


3.0 


Brain (fetal) 


8.7 


6.6 


11.7 


Liver 


0.0 


0.0 


0.0 


Brain (whole) j 


10.4 


10.9 


7.2 


Liver (fetal) 


o.i : 


0.0 


0.0 


Brain (amygdala) \ 


12.8 


14.9 


12.9 


Liver ca. 

(hepatoblast) 

HepG2 


0.2 ; 


0.4 ! 


0.0 


Brain ■ 
(cerebellum) 


10.0 


2.6 


2.0 


Lung 


0.4 


0.7 


0.2 


Brain 

(hippocampus) 


17.7 


13.5 


12.3 


Lung (fetal) 


0.3 


0.7 


1.9 


Brain (substantia 
nigra) 


1.8 


1.5 


0.7 


Lung ca. 
(small cell) 
LX-1 


0.0 


0.0 


0,0 


Brain (thalamus) 


19.3 


12.2 


7.3 


Lung ca. 
(small cell) 
NCI -H 69 


3.1 


13.8 


9.9 


Cerebral Cortex 


8.0 


100.0 


68.8 


Lung ca. 
(s.cell var.) 
SHP-77 


2.4 


1.7 


2.2 


Spinal cord 


1.4 


13.0 


10.2 


Lung ca. 
(large 
celI)NCI~ 
H460 


18.6 


0.0 


0.0 
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glio/astro U87- 
MG 


17 A 

lo.o 


1A f 




jLung ca. 

KI2UXr*MIjL t-Cily 

JA549 


0 4 


0.0 


0 0 


glio/astro U-118- 
MG 


90 A 


0 0 


0 9 


jLung ca. 
jNCI-H23" 


i.4 


0.3 


0.0 


astrocytoma 
SW1783 


01 o 


9 A 


9 R 


JLung ca. 

If Ti/Yn c /"■Alii 
H IHJIl-o J 

JhOP-62 


9 5 


0.1 


0.6 


neuro*; met SK- 
N-AS ! 


31.2 


4.0 


3.8 


IT 11 V* fr r-o 

(npn-s.cl) 
JNCI-H522 


' 28.1 


0.0 


0.0 


astrocytoma SF- 1 
539 

j .. .. 


25.2 


0J2 


0.0 


Lung ca. 
(squanx) SW 
|900 


0.6 


2.2 


2.8 


astrocytoma SNB-: 
75 ; 


20.6 


0.8 


2.5 


Lung ca. 
(squam.) 
(NCI-H596 


16.5 


6.0 


4.6 


glioma SNB-19 j 


4.7 


15.0 


12.2 


(Mammary 
jgland 


0.7 ; 


1.9 j 


2.2 


t>1inrna TI251 1 


100.0 


5.7 


5.9 


iBreast ca.* 
(pi efl MCF- 

}7 


5.0 


0.3 i 


1.4 


glioma SF-295 


5.6 


1.3 


1.5 


jBreast ca.* ; 
(pl.ef) MDA- i 
(MB-231 


2.4 j 


0.0 


0.0 


( 

Heart (tetal) ' 


1.0 


1.3 


1.2 


Breast ca.* 
((pl.ef) T47D : 


JJ.O ; 


A A 


A A 
\}.\) 


Heart 


0.1 


A c 




jBreast ca. 
jBT-549 


a n 


A 9 i 


0 0 


Skeletal muscle 
(fetal) I 


1 A 
1 .1) 


ao a 


JO. 1 


(Breast ca. 
jMDA-N j 


A 0 • 
U.o 


a n i 


0 0 


Skeletal muscle 


6.0 


0.8 


0.6 


jOvary | 


0.8 j 


100.0 ! 


100.0 


Bone marrow | 


0.0 


0.2 i 


0.7 


jOvarian ca. > 
JOVCAR-3 


58.6 


0.0 


0.7 


Thymus 


0.2 


8.7 


3.7 


Ovarian ca. 
(oVCAR-4 


2.4 


0.0 


0.0 


Spleen j 


o n 


A A 

u.u 


A 0 


(Ovarian ca. 1 
joVCAR-5 




v/.O i 


9 6 


Lymph node 


2.0 


0.2 


0.5 


jOvarian ca. 
joVCAR-8 


8.7 \ 


1,7 : 


0.5 


Colorectal j 


0.3 


3.0 


2.0 


Ovarian ca. 
jlGROV^l 


3.1 ; 


0.0 


0.1 


Stomach 


3.4 


0.2 


0.5 


(Ovarian ca.* 1 
(ascites) SK- 
|OV-3 


27.9 


0.0 


0.0 


Small intestine 


3.5 


0.0 


0.1 


(Uterus 


2.4 1 


1.1 


1.2 


Colon ca. SW480 : 


1.6 


0.0 


0.0 


(placenta 


8.1 


o.2 : 


o.o 


Colon ca.* 

SW620(SW480 

met) 


0.0 


0.0 


0.2 


Jprostate 


2.1 


0.2 


1.0 


Colon ca. HT29 

i 


0.7 


0.0 


0.0 


[Prostate ca.* 
(bone 
jmet)PC-3 


0.7 | 


27.0 


19.2 
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Colon ca. HCT- • 
116 


0.3 


0.0 


0.0 


Testis 


4.5 


1.9 

• ' i 


2.5 


Colon ca. CaCo-2 


8.6 


1.3 

i 


0.3 


Melanoma 
Hs688(A).T 


10.0 


,1.6 , 


2.2 


Colon ca. 1 
tissue(OD03866) i 


2,6 


6.1 


3.7 


Melanoma* 1 
(met) 

tt^ / no /T^fc \ HP I 

Hs688(B).T 


12.5 


0.9 


2.0 



Colon ca. HCC- 
2998 ' 


,0 


0.0 


0.0 


Melanoma , 
UACC-62 , j 


1.2 


0.7 


0.3 


Gastric ca.* (liver 
met) NCI-N87 ! 


0.9 


3.3 
I 


3.7 


Melanoma 
M14 ' ; 


13.7 . 


0.0 


1 0.0 


Bladder 


0.9 


1.9 


2.1 


Melanoma 
LOX IMVI 


1.2 


1.3 ! 


1.4 


- — r 

Trachea - 


0.8 


i 

5.1 


6.1 


Melanoma* j 
(met) SK- 
MEL -5 


3.7 


0.0 j 


0.0 


Kidney ; 


0.8 


3.7 


3.0 


Adipose 


3.6 : . 


2.2 


1'.9 



Table 25 . Panel 2D 



Tissue Name | 


Rel. Exp.(%) 
Ag2581,Run , 
161921268 


ReL Exp.(%) 
'Ag29lO,Run 
162354453 


Tissue Name 


Rel. Exp.(%) 
Ag2581,Run 
1 161921268 ' 


Rel. Exp.(%) 
Ag2910,Run 
162354453 


Normal Colon 


13.2 1 


7.1 


Kidney Margin 
8120608 


3.8 


2.6 


CC Well to Mod j 
Diff (OD03866) 


6.5 


11.0 


Kidney Cancer 
8120613 


0.9 


0.7 


CC Margin 
(OD03866) 


. 2.7 


2.0 


Kidney Margin 
8120614 


7.5 ; 


4.4 


CC Gr.2 

rectosigmoid 

(OD03868) 


1.6 


1.0 


Kidney Cancer 
9010320 


18.4 .i 


22.4 


CC Margin 
(OD03868) 


1.0 


2.1 


Kidney Margin 
9010321 


9.9 


15.9 


CC Mod Diff 
(ODO3920) 


0.5 


1.5 


Normal Uterus 


2.4 


4.9 


CC Margin 
(ODO3920) 


1.4 


5.0 


Uterus Cancer 
064011 


6.8 


8.7 


CC Gr.2 ascend ■ 
colon (OD03921) \ 


5.3 ; 


11.6 


Normal Thyroid 


19.1 


29.5 


CC Margin 
(OD03921) 


0.9 


0.3 


Thyroid Cancer 
064010 


52.9 


75.8 


CC from Partial 
Uepatectomy 
(ODO4309) Mets 


4.2 ; 


1.7 


Thyroid Cancer 
A302152 


3.9 


6.7 


Liver Margin j 
(ODO4309) : 


0.7 


0.4 


Thyroid Margin 
A302153 


31.9 


35.4 


Colon mets to lung ! 
(OD04451-01) 


2.9 


3.1 


Normal Breast 


6.1 


12.2 


Lung Margin 
(OV0A45U02) 


0.8 


4.2 


Breast Cancer 
(OD04566) 


4.1 


4.5 


Normal Prostate 
6546-1 


0.7 


18.7 


Breast Cancer 
(OD045 90-01) 


2.7 


14.3 


Prostate Cancer 
(OD04410) 


6.8 


8.8 


Breast Cancer 
Mets (OD04590- 


21.0 


21.0 
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03) 






Prostate Margin 
(OD04410) 


3.4 

* 


5.7 


Breast Cancer 
Metastasis ■ 
(OD04655-05) ; 


3.4 • ; 


5.3 


Prostate Cancer , 
(OD04720-01) 


10.3 


12.7 i 


Breast Cancer 
064006 


9.7 1 


26.2 


Prostate Margin 
(OD04720-02) 


7.4 


16.2 


Breast Cancer 
1024 


11.3 i 


15.3 


Norma) Lung 
061010 


5.8 


7.2 i 


Breast Cancer 
9100266 


4.9 i 


12.2 


Lung Met to Muscle j 
(OD04286) 


1.8 


3.5 


Breast Margin 
9100265 


10.5 


16.8' 


Muscle Margin 
(OD04286) 


6.8 i 


5.8 


Breast Cancer 
A209073 


17.0 I 


32.3 


Lung Malignant 
Cancer (OD03 126) ] 


20.9 


19.9 


Breast Margin 
A2090734 


6.9 


8.2 


Lung Margin 
(OD03126) 


4.7 


4.9 j 


Normal Liver 


0.0 


0.3 


Lung Cancer 
(OD04404) 


22.8 


22.4 


Liver Cancer 
064003 


0.0 


0.0 


Lung Margin 
(OD04404) 


5.0 ' ; 


4.1 j 


Liver Cancer 
1025 


03 i 


0.7 


Lung Cancer 
(OD04565) 


13.2 


14.6 ] 


Liver Cancer 
1026 


0.7 i 


0.9 


Lung Margin 
(OD04565) 


0.7 


0.6 


Liver Cancer 
6004-T 


0.3 


0.9 


Lung Cancer 
(OD04237-01) 


37.6 


57.8 


Liver Tissue 
6004-N 


0.0 


0.8 


Lung Margin 
(OD04237-02) 


2.4 ; 


1.3 


Liver Cancer 
6005-T 


0.5 


2.1 


Ocular Mel Met to 
Liver (OD043 10) ; 


0.0 


0.3 


Liver Tissue 
6005-N 


0.4 


0.8 


Liver Margin 
(ODO4310) 


0.0 


0.0 


Normal Bladder \ 


6.8 


8.1 


Melanoma Mets to 
Lung (OD04321) 


0.8 


1.7 


1023 


6.7 ; 


8.0 


Lung Margin 


1.9 


4.7 


Bladder Cancer 
A302173 


42.3 


46.3 


Normal Kidney 


21.6 


20.4 


Bladder Cancer 
(OD04718-01) 


2.8 


4.2 


Kidney Ca, Nuclear 
grade 2 (OD04338) 


1 9 


5.0 


Bladder Nonnal 
Adjacent 
(OD047 18-03) 


6.0 


10.2 


Kidney Margin 
(OD04338) 


15.0 


18.2 


Normal Ovary 






Kidney Ca Nuclear 
grade 1/2 
(OD04339) 


1.5 


3.1 


Ovarian Cancer 
064008 


100.0 


100.0 


Kidney Margin 
(OD04339) 


13.7 


20.9 


Ovarian Cancer 
(OD04768-07) 


1.1 


0.6 


Kidney Ca, Clear 
cell type (OD04340) 


4.0 


6.5 


Ovary Margin 
(OD04768-08) 


3.4 


8.5 


Kidney Margin 
(OD04340) 


8.2 


13.1 


Normal Stomach 


5.2 


2.8 
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Kidney Ca, Nuclear 
grade 3 (OD04348) 


1.3 


2.0 


Gastric Cancer 
9060358 


3.4 


5.6 


Kidney Margin 
(OD04348) 


7.3 


14.3 


Stomach Margin 
9060359 


2.0 


2.2 . 


Kidney Cancer 
(OD04622-01) 


15.4 


20.0 


Gastric Cancer 
9060395 ' 


8.3 


17.0 


Kidney Margin 
(OD04622-03) 


1.9 


4.0 


Stomach Margin 
9060394 


6.2 


5.2 


Kidney Cancer 
(OD04450-01) 


0.0 


2.6' 


Gastric Cancer 
9060397 


8.2 


11.6 


Kidney Margin 
(OD04450-03) 


10.5 


9.5 


Stomach Margin 
9060396 


0.9 


0.3 


Kidney Cancer 
8120607 


9.2 


15.4 | 


Gastric Cancer 
064005 


3.8 


9.2 



Table 26 . Panel 3D 



1 ISJiUt i^jnic 


Re!. Exp.(%) 

AolCOl Pun 

164827572 


TWcsiip ^amp 
i j>suc i>ajjjc 


Rel. Exp,(%) 
164827572 


Daoy- Medulloblastoma 


2.3 


carcinoma (metastasis) 


0.5 






PQ_?_ Ovftrinn rlpar pp11 parfinomfl 


1.2 


D283 Med- Medulloblastoma 


0.4 


Ramos- Stimulated with j 


0.0 


PFSK-1- Primitive 

MpnrriPPtPiflprtYifll 


11.3 


Ramos- Stimulated with 

PA/T A /inn ATYivrin 1 A\\ 

1 XvXJrVi lvJilUill y \jxXi 1 *-r±J 


0.0 


XF-498- CNS 


0.7 


MEG-0 1 - Chronic myelogenous 
leukemia (megokaryoblast) 


0.0 


SNB-78- Glioma 


0.0 


Raji- Burkitt's lymphoma j 


0.3 


SF-268- Glioblastoma 


5.1 


Daudi- Burkitt's lymphoma i 


0.1 


T98G- Glioblastoma 


0.4 


U266- B-cell plasmacytoma 


0.1 


SK-N-SH- Neuroblastoma 
(metastasis) 


20.9 


CA46- Burkitt's lymphoma 


0.0 


SF-295- Glioblastoma 


0.0 ' 


RL- nori-Hodgkin's B-cell 
lymphoma 


0.7 


Cerebellum 


2.3 


JM1- pre-B-cell lymphoma I 


0.0 


Cerebellum 


2.2 


Jurkat- T cell leukemia 


0.4 


NCI-H292-Mucoepidermoid 
lung carcinoma 


1.3 


TF-1- Erythroleukemia j 


0.4 


DMS-114- Small cell lung 
cancer 


0.0 


HUT 78- T-cell lymphoma 


0.3 


DMS-79- Small cell lung 
cancer 


4.3 


U937- Histiocytic lymphoma 


0.3 


NCI-H146- Small cell lung 
cancer 


6.6 


KU-812- Myelogenous leukemia 


0.0 


NCL-H526- Small cell lung 
cancer 


100.0 


769-P- Clear cell renal carcinoma 


1.0 


NCI-N417- Small cell lung 
cancer 


1.8 


Caki-2~ Clear cell renal carcinoma . 


0.5 


NCI-H82- Small cell lung 
cancer 


0.3 


SW 839- Clear cell renal carcinoma; 


3.5 


NCI-H157- Squamous cell 
lung cancer (metastasis) 


0.3 


G401- Wilms' tumor 


7.3 
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NCI-H 1155- Large cell lung : 
cancer 


1.1 


Hs766T- Pancreatic carcinoma (LN: 
metastasis) 


4.3 


NCI-HI 299- Large cell lung 
cancer 


0.6 ■ ' 


CAPAN-1- Pancreatic \ 
adenocarcinoma (liver metastasis) 


0.0 


NCI-H727- Lung carcinoid 


6.2 


SU86.86- Pancreatic carcinoma 
(liver metastasis) 


0.8 


NCI-UMC-ll-Lung 
carcinoid 


0.0 


BxPC-3- Pancreatic 
adenocarcinoma 


2.8 


LX-1- Small cell lung cancer 


o.o '•' 


HPAC- Pancreatic adenocarcinoma 1 


0,0 


^UiO-Zv/J- C-Uiun CalJCCI 


0.0 


MIA PaCa-2- Pancreatic carcinoma 


0.0 


KM12- Colon cancer 


jCFPAC-l- Pancreatic ductal 
' (adenocarcinoma 


, 0.0 


KM20L2- Colon cancer 

{ - 


' 0.0 


PANC- 1 - Pancreatic epithelioid 
ductal carcinoma 


0.0 

. ..... 1 


NCI-H716- Colon cancer 


0.9 


T24- Bladder carcinma (transitional; 
cell) _J 


3.1 


SW-48- Colon 
adenocarcinoma 


0.0 


5637- Bladder carcinoma J 


1.0 


SW111 6- Colon 
adenocarcinoma 


; 


HT- 1197- Bladder carcmoma 


1.3 


LS174T- Colon 
adenocarcinoma 


jUM-UC-3- Bladder carcinma : 
.((transitional cell) 


1.3 


SW-948- Colon 
adenocarcinoma 


0.0 | 


A204- Rhabdomyosarcoma \ 


0.3 


SW-480- Colon 
adenocarcinoma 


o.i : 


HT- 1 080- Fibrosarcoma 


12.4 


NCLSNU-5- Gastric 
carcinoma 


0.0 


MG-63- Osteosarcoma s 


0.2 


KATO III- Gastric carcinoma 


0.0 


SK-LMS-1- Leiomyosarcoma 
(vulva) 


9.5 


NCI-SNU- 16- Gastric 
carcinoma 


0.2 


SJRH30- Rhabdomyosarcoma (met 
to bone marrow) 


0.8 


NCI-SNU-1- Gastric 
carcinoma 


0.0 


A431- Epidermoid carcinoma 


0.4 


RIM- Gastric 

aUcIiUi^aruJJlUJtJ la 


0.0 


WM266-4- Melanoma 


1.8 


RF-48- Gastric 
adenocarcinoma 


0.0 


DU 145- Prostate carcinoma (brain 
metastasis) 


0.0 


MKN-45- Gastric carcinoma 


0.5 


MDA-MB-468- Breast 
adenocarcinoma 


0.0 


NCJ-N87- Gastric carcinoma 


0.6 


[SCC-4- Squamous cell carcinoma 
[of tongue 


0.0 


OVCAR-5- Ovarian 
carcinoma 


0.2 


\SCC-9- Squamous cell carcinoma 
[of tongue 


0.0 


RL95-2- Uterine carcinoma 


0.6 


[SCC-15- Squamous cell carcinoma 
|of tongue 


0.5 


HelaS3- Cervical 
adenocarcinoma 


0.2 


(CAL 27- Squamous cell carcinoma 
[of tongue 





Table 27. Panel 4D 





Rel. 


Rel. 


Rel. 




Rel. I 


Rel. 


Rel. 


Tissue Name 


Exp.(%) 


Exp.(%) 


Exp.(%) 


Tissue Name 


Exp.(%) 


Exp.(%) 


Exp.(%) 


Agl479, : 


Ag2581, 


Ag29l0, 


Agl479, 


Ag2581, 


Ag2910, 




Run 


Run 


Run 




Run 


Run 


Run 
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162599612 


164036199 


159079044 




162599612 


1640361991 


159079044 




V.J 


0.0 


0.2 


HUVEC IL- 
Ibeta 


8.1 


0.0 


0.0 


Secondary Th2 act 


0.0 


1 

0.0 


0.0. . 


HUVEC TFN 
gamma 


14.8 


p ■■■ ■ 
0.0 . 


0.0 


Secondary Trl act 


0.0 


0.0 


0.6 


HUVEC TNF ! 
alpha + IFN 
gamma 


8.1 


0.0 1 


0.0 


Secondary Thl 
rest 


0.0 


0.0 


0.0 


HUVEC TNF 
alpha + IL4 


12.0 


O.U ! 


A A 


Secondary Th2 
rest 


0.0 


0.0 | 


0.0 


HUVEC IL-l l 


8.5 


A c\ 
U.O 


A. *A 


Secondary Trl rest 


0.0 , : 


o.o . 


0.0 


Lung 

Microvascular 
EC none 


11.1 


0.0 ' 


0.0 


Primary Thl act 


0.0 


0.0 


0.0 


Lung 

Microvascular 
EC TNFalpha + 
IL-l beta 


9.3 
i 


0.0 


0.0 


Primary Th2 act 


0.0 


0.0, , 


0.0 


Microvascular 
Dermal EC none 


100.0 


0.0 


0.5 


Primary Trl act 


0.0 


0.0 


0.0 


Microsvasular 
Dermal EC 
TNFalpha + IL- 
lbeta 


• 29.7 


t 

0.0 


0.6 


Primary Thl rest 


0.0 


0.0 


0.0 \ 


Bronchial 
epithelium 
TNFalpha + i 
ILlbeta 


0.2 


0.2 


21.8 


Primary Th2 rest 


0.0 1 


0.0 : 


0.0 ; 


Small airway 
epithelium none 


2.2 


0.6 


4.4 


Primary Trl rest j 


0.0 


0.0 


0.0 


Small airway 
epithelium 
TNFalpha + IL- 
lbeta 


0.3 


0.5 j 


4.7 


CD45RA CD4 
lymphocyte act 


1.8 


0.1 


0.2 


Coronery artery 
SMC rest 


8.3 


0.0 


2.4 


CD45RO CD4 
lymphocyte act 


0.0 


0.0 


0.3 


Coronery artery 
SMC TNFalpha 
+ IL-l beta 


4.6 


0.0 


0.3 


CD 8 lymphocyte 
act 


0.0 


0.0 


0.0 


Astrocytes rest 


85.9 


2.9 


19.3 


Secondary CD 8 
lymphocyte rest 


0.0 


0.0 


0.0 


Astrocytes 
TNFalpha + IL- 
lbeta 


59.0 


3.9 


17.0 


Secondary CD 8 
lymphocyte act 


0.0 


0.0 


0.0 


KU-812 
(Basophil) rest 


0.0 


0.0 


0.3 


CD4 lymphocyte 
none 


0.0 


0.0 


0.0 


KU-812 

(Basophil) 

PMA/ionomycin 


0.0 


0.0 


0.0 


2ry 

Thl/Th2/Trl anri- 
CD95 CH11 


0.0 


0.0 


0.0 


CCD 11 06 

(Keratinocytes) 

none 


19.8 


0.7 


4.8 


LAK cells rest 


0.0 


0.0 


0.6 


CCD 11 06 

(Keratinocytes) 


1.7 


0.0 


1.7 
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TNFalpha + IL- 
lbeta • 








LAK cells IL-2 


0.0 


0.0 


0.0 


Liver cirrhosis 


0.0 


0.1 


1.7 


LAK cells IL- 
2+IL-12 ' 


0.0 


U.O 


0.0 


Lupus kidney 


' 1 c 
l.o 


A 1 

U.l 


A 1 

u. / 


LAK cells IL- 
2+IFN gamma 


0.0 


1 00.0 


0.0 


NL-l-HZyz none 


A A 

u.u 


A 1 

U.I 


A < 

u.o 


LAK cells IL-2+ 
IL-18 


0.0 


0.0 


0.2 


NKs\-H2.y2 IL-4 


A A 

U.U 


A A 
U.U 


A A 

U.o 


LAK cells 
PMA/ionomycin 


0.0 


0.0 


0.0 


XT/IT TT<ini TT A 1 

NCI-H2y2 IL-y ' 


A A 


A 1 

U.i 




NK Cells IL-2 rest 


0.0 


0.0 


0.0 


NCI-H292IL-13 


0.0 


0.1 ■ 


0.1 


Twm Wav M7 "R 3 
j vvu way lvxjujx j 

day 


0.0 


o.b 


0.0 


NCI-H292 TFN 
gamma 


0.0 ; 


0.2 


0.0 

, u. 


i — - — ■■ — — 

l WO VV dy iVAJUJtv J» 

day 


0.0 


0.0 


0.0 


HPAEC none 


15.1 


0.0 


0.0 


i wo vv dy jvii^j^ / 
day 


0.0 


0.0 


0.3 


HP AFP TNF 
alpha + IL-1 beta 


6.2 ■ 


0.0 


0.0 


PBMC rest 


0.0 


0.0 


I 

0.0 j 


T imo TiTiTfVr\1iiCt 
JUUil^ IlUiUUldaL 

none 


0.9 


5.8 


51.1 


PBMC PWM 


0.0 


0.0 ; 


0.6 ] : 


T nnfi "fi V\vr\t*t 1 n c f 
J-zUUg HUIUUJaol 

TNF alpha + IL- ■ 
1 beta 


0.6 


1.2 


13.0 


PBMC PHA-L 


0.0 


0.0 


j 

o.o i 


T linn ■F5V»T"r\V»1«lci' 
JL.UJlg llUl UDIdM 

IL-4 


2.1 i 


8.3 j 


82.9 


Ramos (B cell) 
none 


0.0 


0.0 


0.0 | 


Lung fibroblast 
IL-9 


1-2 1 


6.7 


50.7 


Ramos (B cell) 
ionomycin ■ 


0.0 


0.0 


0.0 | 


Lung fibroblast 
IL-1 3 


1.2 


6.3 


67.4 


B lymphocytes 
PWM 


0.0 


0.0 


0.0 j 


Lung fibroblast 
IFN gamma 


2.1 


8.4 ' 


100.0 


B lymphocytes 
CD40L and IL-4 \ 


0.2 


O.l 


o.o j 


Dermal 
fibroblast 
CCD1070 rest j 


10.5 ! 


0.5 ! 


8.4 


EOL-l dbcAMP 


0.2 


0.0 


o.o j 


Dermal 

fibroblast 

CCD 1070 TNF ; 

alpha 


n.6 


0.3 


7.3 


POT - I HhcAMP 
PMA/ionomycin 


O.l 


0.0 


jDermal 
' fibroblast 
U ' U CCD1070IL-1 > 

jbeta 


4.9 | 


1 0.3 


2.0 


none 


0.0 


0.0 


0.6 j 


Dermal 

fibroblast IFN j 
gamma ; 


1.2 


0.1 


1.1 


Dendritic cells 
LPS 


0.0 


0.0 


0.0 | 


Dermal 
fibroblast IL-4 


28.3 


0.3 


11.7 


Dendritic cells 
anti-CD40 


0.0 


0.0 


0.0 j 


IBD Colitis 2 


0.7 I 


0.2 


0.6 


Monocytes rest 


0.0 


O.l 


2.7 | 


IBD Crohn's 


1.6 


0.0 


0.2 


Monocytes LPS 


0.0 


0.0 


0.2 


Colon 


8.6 


0.1 


3.1 


Macrophages rest 


0.0 


0.0 


0.0 j 


Lung 


2.0 ; 


0.8 


12.3 


Macrophages LPS 


0.0 


• 0.0 


0.0 j 


Thymus 


7.0 | 


1.7 i 


20.4 


HXJVEC none 


23.0 


0.0 


0.0 


Kidney 


17.0 ! 


1.2 


16.7 
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HUVEC starved ] 25.0 ] 0,0 ) 0.0 f j ~j " " 1 



CNS_ueurodegeneration_>1.0 Summary: Ag2910/Ag25$l Results from two 
experiments using identical probe/primer sets are in excellent agreement. No difference is 
detected in the expression of the CG56091-01 gene in the postmortem brains of Alzheimer's 
patients when compared to normal controls. However, this panel does demonstrate the 
expression of this gene in the CNS of an independent group of patients. Please see panel 1.3D 
for a discussion of the potential utility of this gene in function of the central nervous system. 

Panel 1.3D Summary: Agl 479/267 4/A g2820 The CG56Q91-01 gene encodes a 
protein that is'homologous to TEN-M4 and may be involved in brain compartmentalization. In 
multiple experiments with different probe/primer sets, highest expression of this gene is seen 
in the brainand in brain cancer cell lines. Thus, inhibitors of this gene product could have 
utility in the treatment of diseases involving neurite outgrowth or organization, such as 
neurodegenerative diseases. 

In addition to expression in brain cancer cell lines, there is substantial expression in 
other samples derived from cancer cell lines, such as breast cancer, lung cancer ovarian 
cancer. Thus, the expression of this gene could be used to distinguish these samples from other 
samples in the panel. Moreover, therapeutic modulation of this gene or its protein product, 
through the use of small molecule drugs, antibodies or protein therapeutics, might be of use in 
the treatment of brain cancer, lung cancer, breast cancer or ovarian cancer. 

This gene is also moderately expressed metabolic and endocrine tissues, including 
adrenal, thyroid, pituitary, fetal heart, adult and fetal skeletal muscle, and adipose. Thus, this 
gene product may be an antibody target for the treatment of any or all diseases in these tissues, 
including obesity and diabetes (Mieda M, Kikuchi Y, Hirate Y, Aoki M, Okamoto H. 
Compartmentalized expression of zebrafish ten-m3 and ten-rn4, homologues of the Drosophila 
ten(rn)/odd Oz gene, in the central nervous system. Mech Dev 1999 Sep;87(l~2):223-7). 

Panel 2D Summary: Ag2581/Ag291 0 Results from two experiments using an 
identical probe/primer set show reasonable concordance, with both runs showing highest 
expression of the CG56091-01 gene in ovarian cancer (CT = 28-29). In addition, the level of 
expression of this gene appears to be increased in some lung and gastric cancer tissue samples 
when compared to the matched normal tissue. The reverse appears to be true for kidney, where 
expression is slightly higher in 6 of 9 normal tissues than in the matched cancer tissues. Thus, 
based upon its profile, the expression of this gene could be of use as a marker for 
distinguishing these cancers from the normal adjacent tissue or as a marker for different 
grades/ types of cancer. Furthermore, therapeutic inhibition of the activity of the product of 
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this gene ? through the use of antibodies, peptides or polypeptides, maybe useful in the ■ 
treatment of gastric and lung cancer. 

Panel 3D Summary: A42581 The CG56091-01 gene is expressed a!t a low level by 
select cell lines used in this panel. The highest level of expression is seen in NCTH526, a lung 
5 cancer cell line (CT-27.3). Other cell lines that express this gene include neuroblastoma, 
bladder carcinoma and renal cell cancer cell lines. Therefore, therapeutic inhibition of the 
activity of the product of this genp, through the use of antibodies, peptides or polypeptides, 
may be useful in the therapy of cancers used in the derivation of these cell lines. 

Panel 4D Summary: Agj £79/Ag2674/Ag2S20 The expression of the CG56091-01 
10 gene is highest in astrocytes and microvascular dermal endothelial cells (CTs : =29-30), with 
low but significant expression in keratinocytes, and dermal fibroblasts. Expression is not' 
modulated by any treatment, suggesting that this protein may be important in normal 
homeostasis. Thus, this transcript or the protein it encodes could be used to identify the tissues 
and cells in which it is expressed. ! » 

15 B. NOV2: Semphoi in-like 



Expression of the NOV2 gene (20422974JM32_dal) was assessed using the primer- 
probe set Ag37, described in Table 28. Results of the RTQ-PCR runs are shown in Table 29. 

Table 28 . Probe Name Ag37 



Primers 


Sequences 


Lengtrv 


Start 
Position 


Forward 


5 1 -ggcttcctcatggtactcctta-3 1 {SEQ ID NO: 199) j 


22 


983 


Probe 


TET-5 1 -ccgctggatctcttccaactggtact-3 » -TAMRA (SEQ 
ID NO:200) 


26 ! 


940 


Reverse 


5 » -acagtggggtgacatgtacct-3 1 (SEQ ID NO; 201) 


21 


905 



Table 29 . Panel 1 



J 

j Tissue Name 


Rel. Exp.(%) Ag37, Run 
87355114 


Tissue Name 


Rel. Exp.(%) Ag37, Run 
87355114 


Endothelial cells 


1.1 


Renal ca. 786-0 


4.6 


Endothelial cells (treated) 


2.2 


Renal ca. A498 


2.7 


Pancreas 


4.9 


Renal ca. RXF 393 


3.2 ! 


Pancreatic ca. CAP AN 2 


2.0 


Renal ca. ACHN 


2.4 


Adrenal gland 


9.2 


Renal ca. UO-31 


1.7 


Thyroid 


6.3 


Renal ca.TK-10 


4.3 


Salivary gland 


4.6 


Liver 


8.5 


Pituitary gland : 


9.1 


Liver (fetal) 


4.5 


Brain (fetal) 


12.1 \ 


Liver ca. (hepatoblast) 
HepG2 


7.1 


Brain (whole) j 


24.0 


Lung 


4,5 


Brain (amygdala) 


6.4 


Lung (fetal) 


15.1 


Brain (cerebellum) ! 


37.9 


Lung ca. (small cell) LX-1 


3.7 
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Brain (hippocampus) 


28.7 


Lung ca. (small cell) NCI- 
H69 , 


2.1 


Brain (substantia nigra) ! 


15.1 


Lung ca. (s.cell var.) SHP- 

77 , , 


0.0 


Brain (thalamus) 


14.5 


Lung ca. (large cell)NCI- 
rl4oO 


0.0 


Brain (hypothalamus) 


4.0 


Lung ca. (non-sm. cell) 

a c^n 
Aj49 


3.0 


Spinal cord 


32.3 


Lung ca. (non-s.cellj NCI- 
H23 


12.1 


glio/astroU87-MG 


1.9 


Lung ca. (non-s.c'ell) 
HOP-62 


4.7 


glio/astroU- U8-MG j 


, 2.4 


Lung ca. (non-s.cl) NCI- 
H522 


14.1 


astrocytoma SW1783 


0.7 


Lung ca. (squaxn.) SW 900 


13.9 1 


neuro*; met SK-N-AS 


ii.o ; 


Lung ca. (squam.) NCI- 
H596 


2.0 


astrocytoma SF-539 ! 


9.3 


Mammary gland 


16.4 


astrocytoma SNB-75 \ 


10.5 


Breast ca.* (pl.ef) MCF-7 


100.0 


glioma iSNJo-iy > 




Breast ca.* (pl.ef) MDA- 
MB-231 


1.4 


glioma U251 


1.6 


Breast ca.* (pi. ef) T47D 


2.6 


glioma SF-295 


3.7 


Breast ca. BT-^49 


0.0 


Heart 


8.3 


Breast ca. MDA-N 


7.3 


Skeletal muscle 


2.9 


Ovary 


18.8 


Bone marrow 


1.9 


Ovarian ca. OVCAR-3 


6.1 


Thymus 


5.8 


Ovarian ca. OVCAR-4 j 


1.5 


Spleen 


18.8 


Ovarian ca. OVCAR-5 [ 


5.5 


Lymph node 


1 % 0 

1 _5.V 


Ovarian ca OVCAR-S 


6.9 


Colon (ascending) 


0 0 


Ovarian ra TGROV-1 

wv ai lull V/ ti . i vjivw v x 


2.6 


Stomach 


7.2 ; 


Ovarian r*a fa^citfis^ SK- 

KJ veil Kill \^<l, I tlOvJIlviJI 

OV-3 


2.5 


Small intestine 


6.5 


Uterus ; 


6.2 


colon ca. ibW4ou ; 




T^lappnta 
jr idvcjuia 


45.4 


Colon ca.* SW620 (SW480 
met; 


2.9 


Prostate 


12.3 


Colon ca. HT29 


0.9 


Prostate ca * fbone met^ 
PC-3 


0.0 


Colon ca. rici - \ i o 


0.0 


Testis 


8.9 


colon ca. cat^o-z 


5.9 


Melanoma Hs688(A).T 


1.0 


Colon ca. HCT-15 




Melanoma* (met) 
Hs688(B).T 


1.2 


Colon ca. HCC-2998 


1.5 


Melanoma UACC-62 


1.6 


Gastric ca. * (liver met) 
NCI-N87 


6.3 


Melanoma M14 


9.7 


Bladder 


5.4 


Melanoma LOX IMVI 


9.8 


Trachea 


8.5 


Melanoma* (met) SK- 
MEL-5 


5.8 


Kidney 


4.5 


Melanoma SK-MEL-28 | 9.2 


Kidney (fetal) 


11.8 
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CNS^neurodegenerationvl.O Summary: Ag37 Expression of this gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

Genera!_screeningjpaneljvl.4 Summary: Ag37 Results from one experiment with 
the 20422974JM32_dal gene are not included. The amp plot indicates that there were 
experimental difficulties with this run. 

Panel 1 Summary: Ag37 The 20422974_0_132_dal gene is expressed at high to 
moderate levels in the majority of the samples on this panel. Expression of this gene is highest 
in a breast cancer cell line (CT = 22). In addition, there is substantial expression in' placental 
tissue and in a number of brain regions. Thus, the expression of this gene, could be used to 
distinguish the sample derived from MCF-7 cells from other samples in the panel. Moreover, 
therapeutic modulation of this gene or its protein product, through the use of small molecule 
' drugs, antibodies or protein therapeutics, might be of benefit in the treatment of breast cancer. 
In general, however, expression appears to be higher in normal cells than in the cancer cell 
lines. 

Among tissues with endocrine or metabolic activity, this gene is expressed at high 
levels in pancreas, adrenal gland, thyroid, pituitary gland, hypothalamus, heart, skeletal 
muscle, and liver. This observation indicates an importance for this gene in endocrine and 
metaboic physiology. Therefore, therapeutic modulation of this gene and/or gene product may 
prove useful in the treatment of diseases associated with these physiological systems, 
including diabetes and obesity. 

Expression of the gene is very high throughout the central nervous system including in 
the spinal cord, amygdala, cerebellum, hippocampus, thalamus, and substantia nigra. This 
gene encodes a protein with homology to sempaphorins. Semaphorins can act as axon 
guidance proteins, specifically as chemorepellents that inhibit CNS regenerative capacity. 
Therefore, therapeutic modulation of the activity and/or amount of this protein may be of use 
in inducing a compensatory synaptogenic response to neuronal death in Alzheimer's disease, 
Parkinson's disease, Huntington's disease, spinocerebellar ataxia, progressive supranuclear 
palsy, multiple sclerosis, ALS, head trauma, stroke, or any other disease/condition associated 
with neuronal loss. 

Panel 4.1D Summary: Ag37 Expression of this gene is low/undetectable (CTs 
> 35) across all of the samples on this panel (data not shown). 

C. NOV3: ERYTHROID MEMBRANE-ASSOCIATED PROTEIN 

Expression of the NOV3 gene (CG50351-01) was assessed using the primer-probe set 
Ag2623, described in Table 30. Results of the RTQ-PCR runs are shown in Tables 31-34. 
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Table 30 . Probe Name Ag2623 



Primers 


Sequences 


Length 

■ ft 1 — 


Start 
Position 


Forward 


5 » -tggactatgaagcaggagtcat-3 ' ( (SEQ ID NO: 202) 


22 


1397 


Probe 


TET-5 ' -caatgtgaccaacaagtcccacatct-3 1 -TAMRA (SEQ 
ID NO:203) 


26 


1428 


Reverse 


5 ' -cagagaaattgtgggtgaaagt-3 » (SEQ ID ttO;204) 


22 


1456 



Table 31 . CNS_neurodegeneration_vl .0 



Tissue Name 


Kel. Exp.(%) Ag2623, Kun 
208393898 


Tissue Name 


Kei. iLxp.( /o) Ag^ozj, j\un 
208393898 


AD 1 Hippo , 


32.5 


Control (Patn) J 
Temporal Ctx 


13.0 


AD 2 Hippo 


48.6 


Control (Path) 4 
Temporal Ctx 


40.3 


AD 3 Hippo 


12.2 


AD 1 Occipital Ctx 


22.8 


AD 4 Hippo 


14.3 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


87.1 


AD 3 Occipital Ctx 


6.9 


AD 6 Hippo ; 


, 65.1 


AD 4 Occipital Ctx 


25.3 


Control 2 Hippo 


40.6 


AD 5 Occipital Ctx 


1 33.9 


Control 4 Hippo 


43.2 


AD 6 Occipital Ctx • 


12.5 


Control (Path) 3 Hippo i 


15.8 


Control 1 Occipital Ctx 


5.1 


AD 1 Temporal Ctx 


33.4 


Control 2 Occipital Ctx 


46.7 


AD 2 Temporal Ctx 


47.6 


Control 3 Occipital Ctx 


19.9 


AD 3 Temporal Ctx 


.9.0 


Control 4 Occipital Ctx 


14.5 


AD 4 Temporal Ctx ■ 


35.4 | 


Control (Path) 1 
Occipital Ctx 


72.2 


AD 5 Inf Temporal Ctx 


97.9 \ 


Control (Path) 2 
Occipital Ctx 


30.8 


AD 5 Sup Temporal 
Ctx 


85.3 


Control (Path) 3 
Occipital Ctx' , 


8.1 


AD 6 Inf Temporal Ctx . 


38.7 j 


Control (Path) 4 
Occipital Ctx 


20.0 


AD 6 Sup Temporal 
Ctx 


50.0 


Control 1 Parietal Ctx 


11.9 


Control 1 Temporal Ctx 


11.4 


Control 2 Parietal Ctx 


100.0 


Control 2 Temporal Ctx : 


38.2 


Control 3 Parietal Ctx 


22.2 


Control 3 Temporal Ctx ; 


18.3 


Control (Path) 1 
Parietal Ctx 


45.4 


Control 3 Temporal Ctx 


24.7 


Control (Path) 2 
Parietal Ctx 


27.4 


Control (Path) 1 
Temporal Ctx 


58.2 


Control (Path) 3 
Parietal Ctx 


6.3 


Control (Path) 2 i 
Temporal Ctx 


42.0 


Control (Path) 4 
Parietal Ctx 


25.2 



Table 32 . Panel 1.3D 



Tissue Name 1 ReL Exp * (% > Ag2623 ' Run : 
I issue rsame j 167644904 


Tissue Name 


Rel. Exp.(%) Ag2623> Run 
167644904 


Liver adenocarcinoma j 12.9 


Kidney (fetal) 


51.1 
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Pancreas 


3.9 


|Renal ca. 786-0 


11.0 ■ 


Pancreatic ca. CAP AN 2 




jRenal ca. A498 


4.1 


Adrenal gland 


3.1 


|Renal ca. RXF 393 


7.9 


Thyroid 


' 6.3 


jRenal ca'. ACHN 


. 4.4 


OallVaiy glaliu 


*t»J 


iRenal ca UO-3 1 


3.9 


Pituitary gland 


9.9 


Renal ca. TK-10 


7.8 


Brain (fetal) 


3.8 


It • 

[Liver 


8.2 


Brain (whole) 


7.5 


Liver (fetal), 


100.0 


Brain (amygdala) 




Liver ca. (hepatoblast) 
HepG2 


3.6 


Brain (cerebellum) 


3.6 


Lung 


4.1 


Brain (hippocarnpus) 


3.9 


Lung (fetal) 


8.4 


Brain (substantia nigra) • 


7.3 


Lung ca. (small cell) LX- 

ii 


4.1 


Brain (thalamus) 


4.8 


Lung ca. (small cell) 
NCI-H69 


0.4 


Cerebral Cortex 




Lung ca. (s.cell var.) i 
SHP-77 


3.4 


Spinal cord 


'9'.4 


Lung ca. (large cell)NCI- 
H460 


2.2 


glio/astro U87-MG ' 


9.6 


Lung ca. (non-sm. cell) 
A549 


7.4 


glio/astro U- 1 1 8-MG 


6.9 


Lung ca. (non-s.cell) 
NCI-H23 


5.D 


astrocytoma SW 1783 ; 


0.7 


Lung ca. (non-s.cell) 
HOP-62 


o. / 


neuro ; met bK-N-Ab 


l.o • 


Lung ca. (non-s.cl) NCI- 
H522 


j.j 


astrocytoma SF-539 

.i 


5.7 


Lung ca. (squam.) SW 
900 


3.9 


astrocytoma SNB-75 


12.6 


Lung ca. (squam.) NC1- 


1.7 


glioma oiNiD-iy 


j.j 


Mammary gland 


1 9 R 
1 z.o 


glioma U251 


12.7 


joredSL \^a, ^pi.ez^ iviv^r- 
7 


7.5 


glioma SF-295 


7.3 


Breast ca.* (pl.ef) MDA- 1 
MB-231 


2.9 


Heart (fetal) 




Breast ca.* (pl.ef) T47D 


15.4 


Heart 


2.5 


Breast ca. BT-549 


4.5 


Skeletal muscle (fetal) 


3.5 ; 


Breast ca. MDA-N 


1.6 


Skeletal muscle 


2.6 


Ovary 


4.9 


Bone marrow , 


31.9 


Ovarian ca. OVCAR-3 


6.6 


Thymus 


5.7 


Ovarian ca. OVCAR-4 


4.2 


Spleen 


3.0 


Ovarian ca. OVCAR-5 


18.4 


Lymph node 


2.7 


Ovarian ca. OVCAR-8 


2.0 


Colorectal 


8.9 


Ovarian ca.IGROV-1 


1.9 


Stomach 


3.8 


Ovarian ca.* (ascites) 
SK-OV-3 


28.9 


Small intestine 


1.6 


Uterus 


5.1 


Colon ca. SW480 


5.5 


Placenta 


0.9 


Colon ca.* SW620(SW480 


17.3 j 


Prostate 


2.9 
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met) 








Colon ca. HT29 


3.8 


Prostate ca.* (bone 
met)PC-3 


3.7 


Colon ca. HCT-116 


1.5 ' 


Testis' ' '. 


1.0 


Colon ca.' CaCo-2 


4.4 


Melanoma Hs688(A).T 


4.8 


Colon ca. 
tissue(OD03866) 


3.4 


Melanoma* (met) 
Hs688(B).T 


3.0 


Colon ca. HCC-2998 


6.3 


Melanoma UACC-62 


3.8 


Gastric ca.* (liver met) 
NCI-N87 


8.2 


Melanoma M34 


1.3 


Bladder 


6.7 


Melanoma LOX IMVI : 


0.8 


Trachea 




Melanoma* (met) SK- 
MEL-5 


0.3 


Kidney 


7.7 


Adipose 


7.7 



Table 33 . Panel 2.2 



Tissue Name 


ReL Exp.(%) Ag2623, 


Tissue Name 


Rel. Exp.(%) Ag2623, 


(Normal Colon 


33.7 


Kidney Margin (OD04348) 


70.2 


jcolon cancer (OD06064) 


48.3 


Kidney malignant cancer 
(OD06204B) 


18.3 


Colon Margin (OD06064) 


57.0 


Kidney normal adjacent 
tissue ^OD06204E^ 


27.2 


Colon cancer (OD06159) 


4.6 


Kidney Cancer (OD04450- 
01> : 


46.3 


Colon Margin (OD06159) 


19.3 


Kidney Margin (OD04450- 
03) 


18.7 


Colon cancer (OD06297-04) 


2.1 


Kidney Cancer 8120613 


2.9 


Colon Margin (OD06297- 
015) 


30.8 


Kidney Margin 8120614 


17.9 


CC Gr.2 ascend colon 
(OD03921) 


2.9 


Kidney Cancer 9010320 


7.1 


CC Margin (OD03921) 


2.8 


Kidney Margin 9010321 


5.3 


Colon cancer metastasis 
(OD06104) 


8.0 


Kidney Cancer 8120607 


14.0 


Lung Margin (OD06104) 


16.4 


Kidney Margin 8120608 


3.8 


Colon mets to lung 
(OD04451-01) 


7.5 


Normal Uterus 


68.3 


Lung Margin (OD04451-02) 


25.7 


Uterine Cancer 06401 1 


15.8 


Normal Prostate 


16.0 


Normal Thyroid 


17.8 


Prostate Cancer (OD04410) 


9.7 


Thyroid Cancer 064010 


29.1 


Prostate Margin (OD04410) 


9.3 


Thyroid Cancer A302152 


34.6 


Normal Ovary 


11.2 


Thyroid Margin A302153 


6.2 


Ovarian cancer (OD06283- 
03) 


18.6 


Normal Breast 


49.0 


Ovarian Margin (OD06283- 
07) 


8.6 


Breast Cancer (OD04566) 


7.5 


Ovarian Cancer 064008 


10.7 


Breast Cancer 1024 


36.6 


Ovarian cancer (OD06145) 


4.8 


Breast Cancer (OD04590- 
01) 


20.4 


Ovarian Margin (OD06145) 


18.2 


Breast Cancer Mets 
(OD04590-03) 


34.6 
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Ovarian cancer (OD0645 5- 
03) 


14.3 


Breast Cancer Metastasis 


40.9 ' 


Ovarian Margin (OD06455- 
07) 


18.7 

i 


Breast Cancer 064006 




Normal Lung , 


5.8 


jbreasi uancer yruuzoo 


17 7 


Invasive poor diff. lung 


10.7 


Breast Margin 9100265 


33.9 




16.7 


Breast Cancer A209073 


5.4 


X-<UJig JViallgllalH ^allw-CI 

(OD03126) 


5.3 


Breast.Margin A2090734 


21.0 


Lung Margin (OD03126) 


'. 4.7 


Breast cancer (OD06083) 


62.4 


Lung Cancer (OD0501 4 A) 
i 


7.7 


Breast cancer node 
metastasis (OD06083) 


34.2 


Lung Margin (OD05014B) 1 


49.0 


Normal Liver 


32.1 


Lung cancer (OD06081) 


34.2 


Liver Cancer 1026 


6.0 


Lung Margin (OD06081) 


17.7 


Liver Cancer 1025 


39.0 • 


T iiT-.fi Potior (C\T\C\A r )'\l C\\\ 

LyUng cancer [KJiJvHZj /-uj ) 


6.4 


Liver Cancer 6004 -T 


16.4 


Lung JViargin {uuv^Zd /~uz ) 




. 1 — , j 

1 iver Tissue 6004-N 


6.4 


Ocular Melanoma Metastasis 


5.7 


Liver Cancer 6005-T 


27.4 


Ocular Melanoma Margin 
(Liver) 1 


24.7 


Liver Tissue 6005-N 


45.1 


Melanoma Metastasis 


4.8 ; 


Liver Cancer 064003 


10.3 


Melanoma Margin (Lung) 


20.9 


Normal Bladder 


1 O 1 


Normal Kidney 


11.7 


Bladder Cancer 3 023 


7.3 


Kidney Ca, Nuclear grade 2 
(OD04338) 


48.0 


Bladder Cancer A302173 


8.1 


Kidney Margin (OD04338) 


25.9 


Normal Stomach 


31.6 


Kidney Ca Nuclear grade 1/2 
(OD04339) 


100.0 


Gastric Cancer 9060397 


2.3 


Kidney Margin (OD04339) 


20.3 


Stomach Margin 9060396 


10.3 


Kidney Ca, Clear cell type 
(OD04340) 


8.8 


Gastric Cancer 9060395 


13.4 


Kidney Margin (OD04340) 


16.6 


Stomach Margin 9060394 


27.5 


Kidney Ca, Nuclear grade 3 
(OD04348) 


8.2 


Gastric Cancer 064005 


6.9 


Table 34. Panel 4D 


Tissue Name 


Rel. Exp.(%) Ag2623, 
Run 164299479 


Tissue Name 


Rel. Exp.(%) Ag2623, 
Run 164299479 


Secondary Thl act 


2.5 


HUVEC IL-lbeta 


1.2 


Secondary Th2 act 


4.2 


HUVEC IFN gamma 


5.2 


Secondary Trl act 


4.1 


HUVEC TNF alpha + IFN 
gamma 


4.1 


Secondary Thl rest 


2.4 


HUVEC TNF alpha + IL4 


4.4 


Secondary Th2 rest 


2.4 


HUVEC IL-11 


3.8 


Secondary Trl rest 


3.8 


Lung Microvascular EC none 


11.7 


Primary Thl act 


3.2 


Lung Microvascular EC 
TNFaVpha + IL-lbeta 


8.7 


Primary Th2 act 


3.8 


Microvascular Dermal EC none 


14.3 


Primary Trl act 


3.3 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


; 7.0 
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Primary Thl rest 


1L3 


Bronchial epithelium TNFalpha 
>ILlbeta' 


10.7 


Primary Th2 rest 


7.7 


| Small airway epithelium none . 


; 3.2 


Primary Trl rest 


6.2 


Small airway epithelium ■ 
TNFalpha 4- IL- 1 beta 


14.5 ' 


CD45RA CD4 lymphocyte 
act 


4.5 


Coronery artery SMC rest 


6.0 


CD45RO CD4 lymphocyte 
act 


3. 1 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


4.0 


CDS lymphocyte act 


2.5 


Astrocytes rest • 


8.2 


Secondary CD 8 
lymphocyte rest 


2.3 


Astrocytes TNFalpha + IL-lbeta 


.7.6 


Secondary CD8 
lymphocyte act 


2.5 


KU-812 (Basophil) rest 


77.9 

i 


CD4 lymphocyte none 


1.9 


KU-812 (Basophil) 

T^\yf A /if\Tif\m , \/f > in 
x l\Lr\J lUllUIliyOJil 


100.0 


2ryThl/Th2/Trl_anri- 
i rnQs phi i 


4.7 


CCD1 106 (Keratinocytes) none 


3.1 


LAK cells rest 


5.8 


l^jji loo Rivera unocyiesj 
TNFalpha + IL-lbeta 


1.3 


LAK cells IL-2 


5.8 


Liver cirrhosis 


4.0 


LAK cells IL-2+IL-12 


4.0 


Lupus kidney 


1.8 


LAK cells 1L-24-IFN 
gamma 


6.5 


NCI-H292 none 


23.3 


LAK cells IL-2+ IL-18 


3.5 


NC1-H292 IL-4 


23.3 


LAK cells 
PMA/ionomycin 


1.2 


NCI-H292 IL-9 


24.3 


NK Cells IL-2 rest 


4.2 


NCI-H292IL-13 \ 


13.2 


Two Way MLR 3 day 


9.0 


NCI-H292 1FN gamma 


15.3 


Two Way MLR 5 day 


3.0 


HPAEC none 


4.8 


Two Way MLR 7 day 


1.5 


HPAEC TNF alpha + IL-1 beta 


6.2 


PBMC rest 


3.6 


Lung fibroblast none 


12.3 


PBMC PWM 


10.3 


1 beta 


9.2 


PBMC PHA4 


J .u 


T nno fiTrr/Vm c» ct TT -A 


15.5 


Tannine (VK ppIA nrmf* 


1 4 




12.2 


TCamnc (VK ppII^ ionornvrin 




T nno fi HrrVhla ct TT — 1 ^ 

l^iUAlg liUiUUldol JUL/"* U 


1 1.0 


R li/moTi/ip vfv^c PWTV/f 
u i yuiLJiiuks y ivo j rt ivi 


1 i .0 


T lino Tl r*TV-\l"il«3 ct TTTNJ ci Q m m n 
L.jlillg ilUliJUicLhl iri\ galilltlcL 


13.5 


"R lvmnhorvtp<s CD4DT 

and IL-4 


10.9 


Dermal fibroblast CCD 1070 rest 


15.7 


EOL-J dbcAMP 


8.1 


Dermal fibroblast CCD 1070 
TNF alpha 


16.3 


EOL-l dbcAMP 
PMA/ionomycin 


6.4 ; 


Dermal fibroblast CCD 1070 IL- 
1 beta ; 


8.9 


Dendritic cells none 


4.5 


Dermal fibroblast 1FN gamma ' 


6.0 


Dendritic cells LPS 


5.6 


Dermal fibroblast IL-4 


16.8 


Dendritic cells anti-CD40 


8.5 


IBD Colitis 2 


1.0 


Monocytes rest 


12.0 j 


[BD Crohn's 


1.4 


Monocytes LPS 


7.8 jColon J 


6.0 


Macrophages rest 


6.5 | 


Lung \ 


8.6 


Macrophages LPS j 


4.5 jThymus 


12.9 
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HUVEC none 


5.9 


(Kidney 


1 


5.8 


HUVEC starved 


9.0 


\ ' 


1 





CNS_neurodegeneration_vJ.O Summary: Ag2623 No differential expression of this 
gene is found in Alzheimer's disease brains when compared to control postmortem brains. 
However, this panel confirms the expression of this gene at moderate level in the CNS in an 
independent group of patients. Please see Panel L3D for a discussion of utility of this gene 



5 product in central nervous system function. 

Panel 1.3D Summary: Ar2623 Expression of the CG50351-01 gene is highest in a 
sample derived from fetal liver (CT = 27.4). In addition, there is substantial expression of this 
gene in bone marrow and fetal kidney. Thus, the expression of this gene could be used to , 
distinguish these tissues, and particularly fetal liver, from the other tissues on this panel. 

10 Moreover, expression of the CG50351-O1 gene in fetal liver and bone marrow suggests that 

this gene is likely to be involved in hematopoesis. Thus, therapeutic modulation of this gene or 
its protein product, through the use of small molecule drugs, antibodies or protein therapeutics, 
might be of benefit in the treatment of hematopoetic diseases. 

This gene is also expressed at moderate to low levels in all central nervous system 

15 tissues examined, including amygdala, cerebellum, hippocampus, substantia nigra, thalamus, 
and cerebral cortex. The CG50351-01 gene encodes a protein with homology to erythroid- 
membrane associated protein (ERMAP). ERMAP potentially acts as a novel adhesion/receptor 
molecule and this class of protein has been shown to be critical in axon guidance. Therefore, 
this protein may be useful in enhancing/ guiding compensatory synapto genesis in response to 

20 CNS injury (trauma, stroke) or neuro degeneration (Alzheimer's, Parkinson's, or Huntington's 
disease). 

The CG50351-01 gene is expressed in a variety of endocrine and metabolic tissues 
including adipose, liver, pancreas, skeletal muscle and thyroid. Interestingly, expression of this 
gene is much higher in fetal liver (CT = 27.4) than adult liver (CT = 31). Therefore, expression 

25 of this gene maybe useful to distinguish fetal from adult liver and the protein product maybe 
useful in the treatment of liver degenerative disorders. In addition, this gene and its protein 
product may be an essential component in normal endocrine and metabolic physiology. 
Therefore, the therapeutic modulation of this gene or its protein product may be useful in the 
treatment of endocrine and metabolic diseases, including diabetes and obesity. 

30 Panel 2.2 Summary: Ag2623 This gene is expressed at low to moderate levels in the 

majority of tissues on this panel. Expression of the CG50351-01 gene is highest in a sample 
derived from kidney cancer (CT = 30.6). Thus, the expression of this gene could be used to 
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distinguish this sample from the other samples on this panel. Moreover, therapeutic 
modulation of this gene or its protein product, 'through the use of srnall molecule drugs, 
antibodies or protein therapeutics, might be of benefit in the treatment of kidney cancer. 

Panel 4D Summary: Ag2623 The CX350351-01 gene is expressed at low levels across 
5 the samples of panel 4D. However the highest expression of this transcript is found in KU- 
812, a basophil cell line (CT = 27). Basophils play an important role in allergic diseases and 
other diseases such as asthma and IBD. Mpderate expression of this transcript is also found in 
the pulmonary muco-epidermoid cell line NCI-H292 (CT= 29.1), a model cell line for chronic 
obstructive pulmonary disease (COPD) and emphysema. This transcript encodes for an 
10 ERMAP like molecule, a possible novel adhesion/receptor molecule (reference). Therefore 
therapeutic modulation of this gene by antibodies, small molecules or protein therapeutics 
could be useful for the treatement of allergic diseases and also asthma, emphysema, COPD, 

and IBD (Ye TZ, Gordon CT, Lai YH, Fujiwara Y, Peters LL, Perkins AC, Chui DH. Ermap, 

i ■ 

a gene coding for a novel erythf oid specific adhesion/receptor membrane protein. Gene 2000 
15 Jan25;242(l-2):337-45): 

1). NOV4a: VITELLINE MEMBRANE OUTER LAYER PROTEIN I 

Expression of NOV4a gene (CG56375-01) was assessed using the primer-probe sets 

Ag2450 and Ag2536, described in Tables 35-36. Results of the RTQ-PCR runs are shown in 

Tables 37-38. 
20 Table 35 . Probe Name Ag2450 



Primers 


Sequences ! 


Length 


Start 
Position 


Forward, 


5 ' -aagcgagaaagccactaggt-3 ' (SEQ ID NO; 205) 


. 20 . 


312 


Probe 


TET-5 ' -acaccacagcggctcactccatt -3 1 -TAMRA (SEQ ID 
NO;206) 




345 


Reverse 


5 ? -gcaatacgcacgtggtagag-3 1 (SEQ ID NO:207) 


20 


3 90 



Table 36 . Probe Name Ag2536 



Primers 


Sequences 


Length; 


Start 
Position 


Forward' 


5 ' -aagcgagaaagccactaggta-3 ' (SEQ ID NO: 208) 


21 1 


312 


Probe 


TET-5 ' -acaccacagcggctcactccatt-3 1 -TAMRA (SEQ ID 
N0:209) 


23 


345 


Reverse: 


5 1 -gtagagtcccagtctggaagct-3 ' (SEQ ID NO:210) 


22 


374 



Table 37. Panel 1. 3D 



Tissue Name 


Rel. Exp.(%) 
Ag2450, Run 
160661622 


Rel. Exp.(%) 
Ag2536, Run 
365532773 


Tissue Name 


Rel. Exp.(%) 
Ag2450, Run 
360661622 


Rel. Exp.(%) 
Ag2536, Run 
365532773 


Liver 

adenocarcinoma 


0.0 


0.0 


Kidney (fetal) 


3.2 


0.0 


Pancreas 


0.3 


0.5 


Renal ca. 786-0 


0.0 


0.0 
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Pancreatic ca. ' 
CAP AN 2 


0.0 


0.0 


Renal ca. A498 


0.0 


1.5 


Adrenal gland 


• 2 '* i 


0.0 


Renal ca. RXF 

393 '■ 


0.3 , 


0.0 


Thyroid , 


0.7 


0.0 " 


Renal ca. ACHN 


0.0 


0.0 


Salivary gland 


0.9 


0.0 (Renal ca.UO-31 


0.0 


0.0 


Pituitary gland 


/U.J 


20.0 (Renal ca.TK- 10 


u.u 


0.0 


Brain (fetal) 


0.3 


0.0 (Liver 


4.0 


2.9 


Brain (whole) 


0.2 


1.8 jLiver (fetal) 


5.7 


22.8 


Brain (amygdala) 


I 

0.6 


1 jLiver c'a. 
4.1 (hepatoblast) 
|HepG2 


0.1 


* i 

0.0 




Brain (cerebellum) 


0.0 


0.0 (Lung 


25.9 


23.8 


Brain (hippocampus) 


' '3-0 


0.0 (Lung (fetal) 


1.4 


0.0 


Brain (substantia 
nigra) 


0.5 


(Lung ca. (small 
; jcell) LX-l 


0.0 


0.0 


Brain (thalamus) 


1.1 


1.9 . ; 


Lung ca. (small i 
cell) NCI-H69 


i 0.0 


0.0 


Cerebral Cortex 


0.5 


n jLung ca. (s.cell j 
' ' jvar.)SHP-77 


0.0 


1.8 


Spinal cord , 


2.1 \ 


jLung ca. (large 
|cell)NCI-H460 


0.3 


0.0 


glio/astro U87-MG 


0.3 


• jLung ca. (non- 
jsm. cell) A549 


0.0 


0.0 


glio/astro U-118- 
MG 


0,5 


jLung ca. (non- 
^ (sxell) NCI-H23 


o.u 


A A 

U.U 


astrocytoma 
SW1783 


0.0 


iLung ca. (non- 
_ ' |s.cell) HOP-62 


0.0 ' 


0.0 


neuro*; met SK-N- ; 
AS 


0.3 | 


jLung ca. (non- 
U ' U js.cl) NCI-H522 


0.0 


0.0 


astrocytoma br-jjy 


a a 
u.u 


JLung ca. 
U - U ](squam.)SW900j 


u.u 


u.u 


astrocytoma SNB-75 


0.0 


Lung ca. 
•0.0 (squanx)NCI- 
Jh596 


0.0 


0.0 


glioma SNB-19 


0.0 


0.0 jMammary gland j 


1.7 


, 0.3 


glioma Uzz>] 


ft ft 


[Breast ca.* 
. j(pl.ef) MCF-7 


U.U 


0.0 


glioma SF-295 


0.3 


(Breast ca.* j 
0.0 (pl.ef) MDA- 
JMB-231 


0.2 


0.0 


Heart (fetal) 


3.3 


- . (Breast ca.* 
* A |(pl.ef)T47D 


0.0 


0.0 


Heart 


0.0 


(Breast ca. BT- 
U ' U (549 


0.0 


0.0 


Skeletal muscle 
(fetal) 


11.9 


(Breast ca. MDA- 
2J j N 


0.0 


0.0 


Skeletal muscle j 


0.7 


4.0 (Ovary 


2.1 


4.5 


Bone marrow 


1.3 


(Ovarian ca. 
K/ joVCAR-3 


0.0 


0.0 


Thymus 


1.5 


n n (Ovarian ca. 
U,U ]OVCAR~4 


0.0 


0.0 


Spleen 


100.0 


100.0 


Ovarian ca. 


0.0 


0.0 
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1 




OVCAR-5 


1 


Lymph node 


1.4 


9.5 


Ovarian ca. 
OVCAR-8 


0.0 


0.0 


Colorectal 


0.9 


0.0 


Ovarian ca. 
IGROV-1 


0.0 


0.0 


Stomach 


1.7 


1.9 


Ovarian ca.* 
(ascites) SK-OV- 
3 


0.0 


0.0 


Small intestine 


1.8 


4.2, 


Uterus 


0.3 


0.0 




0 ft 


0.0 


Placenta 


0.1 


0.0 


Colon ca.* 
owozU(oW4oU metj 


0.0 


0.0 


, ••■ • ■ - -i - 

Prostate 


1.1 


0.0 


Colon ca. HT29 


0.0 1 


0.0 


X JU.JlCttt/ VstX. 

(bone met)PC-3 


1.0 


0.0 


bolon ca.HCT-116 


0.0 


0 0 


Testis 


0.7 


2.0 


Colon ca. CaCo-2 


0.0 


o.o i 


Melanoma 

lxa\JOOyr\.J. x | 


0.3 


0.0 


Colon ca. 
tissue(OD03866) 


j ,j 


i yj.o 


Melanoma* ! 
Hs688(B).T 


0 0 


n n 

v.v 


Colon ca. HCC-2998 


0.1 


0.0 


Melanoma 
UACC-62 


0.0 


0.0 


Gastric ca.* (liver 
met) NCI-N87 


0.3 


0.0 


Melanoma M 14 


0.0 


0.0 


Bladder 


0.1 


2.3 


Melanoma LOX 
IMV1 


0.0 


0.0 


Trachea 


65.5 


53.6 


Melanoma* 
(met) SK-MEL-5 


0.0 


0.0 


Kidney 


0.9 


1.7 


Adipose 


1.5 


5.1 



Table 38 . Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2450, Run 
160836753 


Rel. Exp.(%) 
Ag2536, Run 
164320658 


Tissue Name 


ReI.Exp.(%) 
Ag2450, Run j 
160836753 


Rel. Exp.(%) 
Ag2536, Run 
164320658 


Secondary Thl act 


0.0 


0.0 


HUVEC IL-3beta 


0.0 


0.0 


Secondary Th2 act 


0.0 


0.0 


HUVEC TFN gamma 


0.0 


0.0 


Secondary Trl act 


0.0 


0.0 


HUVEC TNF alpha + 
EFN gamma 


0.0 


0.0 


Secondary Thl rest - 


0.0 


0.0 


HUVEC TNF alpha 4- 
IL4 


0.0, 


0.0 


Secondary Th2 rest 


0.0 


0.0 


HUVEC IL-11 


0.0 


0.0 


Secondary Trl rest 


0.0 


0.0 


Lung Microvascular 
EC none 


0.0 


0.0 


Primary Thl act 


0.0 


0.0 


Lung Microvascular 
EC TNEalpha + IL- 
lbeta 


0.0 


0.0 


Primary Th2 act 


0.0 


0.0 


Microvascular 
Dermal EC none 


0.0 


4.0 


Primary Trl act 


0.0 


0.0 


Microsvasular Dermal 
EC TNFalpha + IL- 
lbeta ] 


0.0 


0.0 


Primary Thl rest 


0.0 


0.0 


Bronchial epithelium 
TNFalpha + ILlbcta j 


0.0 


5.5 
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Primary Th2 rest 


4.2 


0.0 


Small airway 
epithelium none 


0.0 


4.4 


Primary Trl rest 


0.0 


0.0' 


Small airway 
epithelium TNFalpha 
+ IL~lbeta 


33 


0.0 


CD45RA CD4 
lymphocyte act 


0.0 


0.0 


Coronery artery SMC 
rest 


0.0 


0.0 


CD45RO CD4 
lymphocyte act 


0.0 


0.0 


Coronery artery SMC 
TNFalpha + IL-1 beta 


0.0 


0.0 


CDS lymphocyte act 


0.0 


0.0 


Astrocytes rest 


0.0 


0.0 


Secondary CD8 
lymphocyte rest 


0.0 


0.0 


Astrocytes TNFalpha 
+ IL-lbeta 


0.0 


0.0 


Secondary CD8 
lymphocyte act 


U.O 


u.u 


KU-S 12 (Basophil) 
rest 


A A 

u.u 


U.O 


Cli4 lymphocyte 
none 


0.0 


4.3 


KU-8 12 (Basophil) 
PMA/ionomycin 


0.0 


"f" 

0.0 


2ry 

i;hl/rii2/Trl anti- 
CD95 CH11 


0.0 


0.0 


CCD 1 1 06 

(Kerarinocytes) none 


0.0 


0.0 


LAK cells rest 


12.2 


4.S 


CCD 13 06 
(Kerarinocytes) 
TNFalpha + IL-1 beta 


0.0 


0.0 


LAK cells IL-2 


0.0 


0.0 


Liver cirrhosis 


9.0 


16.0 


LAK cells IL-2+IL- 
12 


3.0 


0.0 


. . - ^ r*™" - 

T lmn^ Ividnev 


0.0 


4.3 


LAK cells IL-2+IFN 
gamma 


0.0 


1.4 


NCI-H292 none i 


0.0 


0.0 


LAK cells IL-2+ IL- 
18 


0.0 


0.0 


NCLH292 IL-4 


0.0 


3.3 


LAK cells 
PMA/ionomycin 


13.1 


0.8 


NCLH292 IL-9 


4.2 


0.0 


KK Cells IL-2 rest ! 


0.0 


0.0 


NCI-H292IL-13 


0.0 


0.0 


Two Way MLR 3 
day 


100.0 


100.0 


NCI-H292 IFN 
gamma > 


0.0 


0.0 


Two Way MLR 5 1 
day 


6.3 


8.5 


HPAEC none 


0.0 


0.0 


Two Way MLR 7 
day 


3.3 


8.6 


HPAEC TNF alpha + ; 
1L-1 beta 


0.0 


0.0 


PBMC rest 


0.0 


4.4 


Lung fibroblast none ; 


4.3 


0.0 


PBMC PWM 


0.0 


0.0 


Lung fibroblast TNF 
alpha + IL- 1 beta 


0.0 


0.0 


PBMC PHA-L 


0.0 


4.7 


Lung fibroblast IL-4 


2.9 


0.0 


Ramos (B cell) none 


0.0 


0.0 


Lung fibroblast IL-9 


0.0 


0.0 


Ramos (B cell) 
ionomycin 


0.0 


0.0 


Lung fibroblast IL-13 


o.o • 


4.3 


B lymphocytes PWM; 


0.0 


0.0 


Lung fibroblast IFN : 
gamma 


0.0 


0.0 


B lymphocytes 
CD40L and IL-4 


0.0 


0.0 


Dermal fibroblast 
CCD 1070 rest 


0.0 


0.0 


EOL-1 dbcAMP 


0.0 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha ; 


0.0 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


0.0 


Dermal fibroblast 
CCD1070IL^1 beta 1 


4.9 


0.0 
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Tl f^n /'I TitHr* ^f^llc Tk/"\ttf» 


0 f) 

1 * 




Dermal fibroblast IFN 
gamma 


0.0 


0 0 


Dendritic cells LPS 


, 0,0 , 


0.0 


Dermal fibroblast IL- 
4 


0.0 r 


0.0 


Dendritic cells anti- 
CD40 


0.0 


0.0 


IBD Colitis 2 . 


0.0 


0.0 


Monocytes rest 


2.0 


0.0 


IBD Crohn's 


0.0 


4.6 j 


Monocytes LPS 


12.4 


25.9 


Colon 


9.2 


0.0 


Macrophages rest 


5.8 


34.4 


Lung 


62.4 


75.8 


Macrophages LPS 


19.8 , 


15.5' 


Thymus 1 


■ 8.2 


' '4.4 , 


UUVEC none 


0.0 


0.0 


Kidney 


19.3 




HUVEC starved 


0.0 


0.0 









Panel 1. 3D Suinmary: Ag2450/Ag2536 Expression of the CG56375-01 gene was 



assessed in two independent runs on Panel 1.3D using identical primer/probe pairs. The results 
from the two experiments are in excellent agreement. In both experiments, expression of this 
gene is highest in a sample derived from spleen tissue. In addition, there is substantial 
5 expression of this gene in lung, trachea and pituitary gland. Thus, the expression of this gene 
could be used to distinguish these tissues from the other tissues in the panel. 1 

Panel 4D Summary: Ag2450/Ag2536 Expression of the CG56375-01 gene was 
assessed in two independent runs on Panel 4D using identical primer/probe pairs. The results 
from the two experiments are in excellent agreement. In both experiments, expression of this 

10 gene is highest in 2-way mixed lymphocyte reaction (MLR) samples (CT - 33), but it is also 
expressed at lower levels in macrophages. In Panel 1.3D, this transcript is detected in spleen, 
an important site of secondary immune responses, consistent with the expression of this 
transcript in activated lymphocytes (MLR) and macrophages. Therefore, expression of this 
gene can be used as a marker of activated lymphocytes. Furthermore, modulation of the 

1 5 activity of this gene or its protein product, using antibodies or protein therapeutics, could be of 
benefit for inhibiting or preventing allograft rejection in transplantation. Consistent with what 
is observed in Panel 1.3D, this gene is also expressed at low levels in the lung and could 
therefore potentially be used as a marker for lung tissues. 
E. CG56089-01: MAST205-like (Synti ophin Associated Kinase-] ike) 

20 Expression of the NOV5 gene CG56089-01 was assessed using the primer-probe set 

Ag2881, described in Table 39. Results of the RTQ-PCR runs are shown in Tables 40-44. 
Table 39 . Probe Name Ag2881 



Primers 


Sequences 


Length 


Start 
Position 


Forward; 


5 ! -tgaaatgcaacagctatcaaca-3 1 (SEQ ID NO: 211) ■ 


22 


2643 


Probe 


TET- 5 1 -tccaactcttcagatactgaaagcaaca-3 ' -TAMRA (SEQ ; 
ID NO; 2 12) " ~ \ 


28 


2665 


Reverse 


5 ' -tttgggaagtaggccagaac-3 ' (SEQ ID NO: 213) 


20 


2705 
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Table 40 . CNSjneurodegeneration_vl.O 



Tissue Name 


ReL Exp.(%) Ag2881, Run 
208700817 


i x issue ii*ime 


Rel. Exp.(%) Ag2881, Run 
208700817 


AD 1 Hippo 


11.4 


jControl (Path) 3 
{Temporal Ox 


9.2 


AD 2 Hippo 


16.7 


jControl (Path) 4 
jTemporal Ctx 


37.6 


J\U 3 HlppO 


i .1 


l uccipnai y-ix 


1 D.U 


AD 4 Hippo 


6.4 


lAD 2 Occipital Ctx 
[(Jvllssmg) 


0.0 


AD 5 hippo 


91.4 


|AD 3 Occipital Ctx 


9,3 


AD 6 Hippo 


, 38.4 


t : 

JAD 4 Occipital Ctx 


24.7 


Control 2 Hippo 


20.0 


jAD 5 Occipital Ctx 


22.1 


Control 4 Hippo 


9,2 


|AD 6 Occipital Ctx 


37.1 


Control (Path) 3 Hippo 


6.5 


(Control 1 Occipital Ctx 


8.3 


AD 1 Temporal Ctx 


18.9 


]Control 2 Occipital Ctx 


79.6 ! 


AD 2 Temporal Ctx 


25.2 


jControl 3 Occipital Ctx 


24.1 


AD 3 Temporal Ctx 


6.7 


|Controi 4 Occipital Ctx ; 


6.5 


AD 4 Temporal Ctx 


19.1 


[Control (Path) 1 
[Occipital Ctx 


55.5 


AD 5 Inf Temporal Ctx ! 


92.7 


Control (Path) 2 
Occipital Ctx ' 


15.8 


AD 5 SupTemporal Ctx I 


16.4 


Control (Path) 3 
Occipital Ctx 


4.5 


AD 6 Inf Temporal Ctx 


34.2 


Control (Path) 4 
Occipital Ctx 


21.2 


AD 6 Sup Temporal Ctx 


39.8 


Control 1 Parietal Ctx : 


11.6 


Control 1 Temporal Ctx ; 


6.7 jControl 2 Parietal Ctx 


28.9 


Control 2 Temporal Ctx 


37.6 


Control 3 Parietal Ctx 


23.8 


Control 3 Temporal Ctx 


20.9 


Control (Path) 1 
Parietal Ctx 


100.0 


Control 4 Temporal Ctx 


11.8 


Control (Path) 2 
Parietal Ctx 


363 


Control (Path) 1 
Temporal Ctx 


41.5 


Control (Path) 3 
Parietal Ctx 


6.5 


Control (Path) 2 
Temporal Ctx 


18.9 


Control (Path) 4 
Parietal Ctx 


40.1 


Table 41. Panel 1. 3D 


Tissue Name 


iRel. Exp.(%) Ag288l 5 Run 
J 167649413 


Tissue Name 


Rel. Exp.(%) Ag2881, Run 
167649413 


Liver adenocarcinoma 


\ 13.2 


Kidney (fetal) 


8.6 


Pancreas 


V 0.7 


Renal ca. 786-0 


0.9 


Pancreatic ca. CAP AN 2 


| 3.8 


Renal ca. A498 


2.0 


Adrenal gland 


J 3.1.... 


Renal ca. RXF 393 


0.0 


Thyroid 


T~ 2.1 


Renal ca. ACHN 


2.5 


Salivary gland 


I 3,1 


Renal ca. UO-31 


0.0 


Pituitary gland 


1 . 1-8 


Renal ca. TK- 10 


1.3 


Brain (fetal) 


J 9.7 


Liver 


0.8 


Brain (whole) 


| 13.6 : 


Liver (fetal) 


0.8 


Brain (amygdala) 


j 4.6 


Liver ca. (hepatoblast) 


4.8 
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I ' ! , |HepG2 T 

iBrain (cerebellum) | ' k 6\2 ' |Luiig 



[Brain (hippocampus) j , t 7.4 |Lung (feral) J t , 7.4 



Brain (substantia ,nigra) 


3.3 


Lung ca. (small cell) LX- 

i 




Brain (thalamus) 


5.7 


Lung ca. (small cell) 
NCI-H69 


13.1 


Cerebral Cortex 


zis.y 


Lung ca. (s.ce.11 var.) 
SHP-77 


zl .3 


Spinal cord 


i 

3.1 


Lung ca.' (large cell)NCI- 
H460 


' i 

1 o 1 


glio/astro U87-M.G 


6.8 


Lung ca. (non-sm. cell) 
A549 


21.3 


glio/astroU-118-MG 


14.2 


Lung ca. (non-s.cell) 
NC1-H23 


0.2 


tibii ot/yiuinii o w l / oj 


1 7 


Lung ca. (non-s.cell) 
HOP-62 


1.1 


neuro*; met SK-N-AS 


4.8 


— ••• - • ■ --. -v • y^- 

Lung ca. (non-s.cl) NCI- 
H522 


0.3 


astrocytoma SF-539 j 


• 0.6 


Lung ca. (squam.) SW 
000 1 


21.5 


. . 

astrocytoma SNB-75 


11.3 


H596 j 


23.2 


glioma SNB-19 


1.6 


Mammary gland 


5.6 


glioma U251 


11.5 j 


Breast ca.* (pl.ef) MCF- \ 

7 ! 


21 .0 


glioma SF-295 


3.1 


Breast ca.* (pl.ef) MDA- ; 
MB-231 


5.5 


Heart (fetal) 


6.4 


Breast ca.* (pl.ef) T47D 


11.0 


Heart 


7.1 


Breast ca. BT-549 


2.1 


Skeletal muscle (fetal) 


5.6 


Breast ca. MDA-N j 


0.0 


Skeletal muscle 


4.7 


Ovary 


10.2 


Rnnf marrow 


1.1 


Ovarian ca. OVCAR-3 


1.3 


Thymus 


6.8 


Ovarian ca. OVCAR-4 


1.0 


Spleen 


1.6 i 


Ovarian ca. OVCAR-5 


31.4 


Lymph node 


3.0 


Ovarian ca. OVCAR-8 


0.4 


Colorectal 


100.0 


vjvanan ca. iokvjv-i 


1 A 


Stomach 


1.8 


Ovarian ca.* (ascites) ; 
SK-OV-3 


4.6 


Small intestine 


0.9 


Uterus 


3.4 


Colon ca. SW480 


1.4 


Placenta 


1.8 


Colon ca.* SW620(SW480 
met) 


13.8 


Prostate 


2.0 


Colon ca. HT29 


1.4 ! 


Prostate ca.* (bone 
met)PC-3 


0.0 


Colonca.HCT-116 


0.0 


Testis 


1.8 


Colon ca. CaCo-2 


4.4 


Melanoma Hs688(A).T 


0.9 


Colon ca. 

tissue(OD03866) ; 


4.0 j 


Melanoma* (met) 
Hs688(B).T 


2.2 


Colon ca. HCC-2998 


2.5 


Melanoma UACC-62 : 


1.1 


Gastric ca.* (liver met) 
NCI-N87 


6.4 


Melanoma M14 


0.0 
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Bladder 


1 4.3 jMelanoma LOX IMVI 


12.2 • 


Trachea 


'"• " « Melanoma* (met) SK- ' ■ 
1,5 jMEL-5 


1.4 


Kidney 


1 8.4 jAdipose ' 


■ r 24.8 



Table 42 . Panel 2.2 



i ibsuc retime 


Rel. Exp.(%) Ag2881, 
Run 175119671 


Tissue Name 


ReLExp.(%) Ag2881, 
Run 175119671 


Normal Colon 


17.2 


Kidney Margin (OD04348) 


100,0 




i 24.3 


Kidney malignant cancer . 
(OD06204B) 


32.8 ' ; 


L-Oion Margin {kjuvokjo^) 
i 




Kidney normal adjacent 
tissue (OD06204E) ■ 


23,3 


Colon cancer (OD061 59) * 


1.9 


Kidney Cancer (OD04450- 
01) 


8.4 


Colon Margin (OD06159) 


8.5 


Kidney Margin (OD04450- 
03) 


21.3 ' 


uoion cancer {uuvozy /-u*tj 


7 R 


Kidnev Cancer 8120613 


0.4 


Colon Margin (OD06297- 

01 D) 


24.5 


Kidney Margin 8120614 


14.3 


CC Gr.2 ascend colon 


4.5 


Kidney Cancer 9010320 


2.7 


CC Margin (UDUiVz I ) 




iviuiicy ividigiiJ /ujvjx-i 


4.8 


Colon cancer metastasis 

(UIW0I04J 


4.1 


Kidney Cancer 8120607 


7.6 


Lung Margin (OD06104) 


6.3 


Kidney Margin 8 1 20608 


2.6 


Colon mets to lung 
(OD04451-01) 


' 3.5 


Normal Uterus 


21.6 


Lung Margin (OD0445 1-02) ! 


25.2 


Uterine Cancer 06401 1 


in i 

19.1 


Normal Prostate 


9.9 


Normal Thyroid 


2.0 


Prostate Cancer (OD044 1 0) 


1.4 


Thyroid Cancer 064010 


6.9 


Prostate Margin (OD044 1 0) ! 


11.9 


Thyroid Cancer A302152 


18.0 


Normal Ovary ] 


19.9 


Thyroid Margin A302153 


3.6 | 


Ovarian cancer (OD06283- 
03) 


7.9 


Normal Breast 


29.3 


Ovarian Margin (OD06283- 
07) 


29.1 


Breast Cancer (OD04566) 


3.8 


Ovarian Cancer 064008 


20.7 


Breast Cancer 1024 


12.6 


Ovarian cancer (OD06145) 


1.7 


Breast Cancer (OD04590- 

PU 


35.8 


Ovarian Margin (OD06145) 


21.5 


Breast Cancer Mets 
(OD04590-03) 


31.6 


Ovarian cancer (OD06455- 
03) _ 


4.0 


Breast Cancer Metastasis 
(OD04655-05) 


59.9 


Ovarian Margin (OV06455- 
07) 


17.0 


Breast Cancer 064006 


4.7 


Normal Lung 


17.9 


Breast Cancer 9100266 


20.9 


Invasive poor diff. lung 
adeno (ODO4945-01 


5.8 


Breast Margin 9 1 00265 


12.2 


Lung Margin (ODO4945-03) 


27.9 


Breast Cancer A209073 


6.6 


Lung Malignant Cancer 
(OD03126) 


9.9 


Breast Margin A2090734 


24.8 
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Lung Margin (OD03126) 


13.6 


Breast cancer (OD06083) 


11.5 


Lung Cancer (OD05014A) 


15.7 


TVrR£*ct rsvnrpT Tiocifi 
metastasis (OD06083) 


7.3 


Lung Margin (OD05014B) 


37.1 


Normal Liver 


27.7 


Lung cancer (OD06081) 


11.6 


Liver Cancer 1026 


2.7 


Lung Margin (OD06081) 


12.6 


Liver Cancer 1025 


7.4 


JLung cancer \{jij\JhZj 


Z.J 


T iver Cancer 6004-T 


12.9 


.Lung jviargm ^ujju^zj /-v/j 


J / .0 


T ivpr Tissue 6004-N 

1 -1 '„ 1- L*. 


0.0 


Ocular Melanoma Metastasis 


3.3 


Liver Cancer 6005-T 


2.9 


Ocular Melanoma Margin 
(JLiverj 


14.0 


Liver Tissue 6005-N 


12.2 


Melanoma Metastasis 


, 2.0 


Liver Cancer 064003 . 


4.8 


Melanoma Margin (Lung) 


29.3 


Normal Bladder 


1 O A 

13.4 ( 


Normal Kidney 


9.4 


Bladder Cancer 1023 


4.0 


Kidney Ca, Nuclear grade 2 
(OD04338) 


52.9 


Bladder Cancer A302173 


4.2 


Kidney Margin (OD04338) 


10.6 


Normal Stomach j 


32.3 


Kidney Ca Nuclear grade 1/2 
(OD04339) 


12.5 


Gastric Cancer 9060397 


1.2 


Kidney Margin (OD04339) 


22.2 


Stomach Margin 9060396 


2.5 


Kidney Ca, Clear cell type 
(OD04340) 


19.8 


Gastric Cancer 9060395 


10.4 


Kidney Margin (OD04340) 


20.6 


Stomach Margin 9060394 


13.7 


Kidney Ca, Nuclear grade 3 
(OD04348) 


5.5 


Gastric Cancer 064005 


4.3 



Table 43 . Panel 4D 



Tissue Name 


Rel.Exp.(%) Ag2881, 
Run 164311037 


Tissue Name 


Rel. Exp.(%) Ag2881, 
Run 164311037 


Secondary Thl act 


5.8 


HUVEC IL-lbeta 


9.8 


Secondary Th2 act 


8.1 


HUVEC IFN gamma 


21.2 


Secondary Trl act 


7.5 ! 


HUVEC TNF alpha 4- IFN j 
gamma : 


23.5 


Secondary Thl rest 


2.5 


HUVEC TNF alpha + IL4 


16.3 


Secondary Th2 rest 


3.8 


HUVEC IL- 11 


5.8 


Secondary Trl rest 


3.4 


Lung Microvascular EC none 


11.3 


Primary Thl act 


3.7 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


15.3 


Primary Th2 act 


3.3 


Microvascular Dermal EC none 


23.7 


Primary Trl act 


6.5 


Microsvasular Dermal EC 
TNFalpha* IL-lbeta 


15.7 


Primary Thl rest 


18.3 


Bronchial epithelium TNFalpha ! 
+ ILlbeta 


29.3 


Primary Th2 rest 


11.8 


Small airway epithelium none 


4.3 


Primary Trl rest 


4.3 


Small airway epithelium 
TNFalpha -ML- 1 beta 


100.0 


CD45RA CD4 lymphocyte 
act 


1.1 


Coronery artery SMC rest 


5.5 


CD45RO CD4 lymphocyte 
act 


4.7 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


3.3 


CD8 lymphocyte act 


0.9 


Astrocytes rest 


2.4 
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Secondary CD8 
lymphocyte rest 


O. / 


Astrocytes iiNrarpna ^ iju-iocia 


? 1 


Secondary CD 8 
lymphocyte act 


2.4 , i 


KU-8 12 , (Basophil) rest , 


5.1 


OT)4 Ivmnhnrvte none 


2.5 


KU-8 12 (Basophil) 
PMA/ionomycin 


27.0 


2ryThl/Th2/Trl anti- 
CD9SCH}! 


4.0 


CCD1 106 (Keratinocytes) none 


6.0 


LAK cells rest 


13 


CCD 11 06 (Keratinocytes) 

l iNrarpna ' 1L,- 1 Deia 


9.0 


LAK cells IL-2 


2.0 


Liver cnxhosis 


1 .8 


LAK cells IL-2-HL-I2 


Z.j 


lAipus KJuiiey 


, o.9 


LAK cells IL-2+EFN 
gamma 


'6.2 


NCLH292none 


28.5 

_ — . 1 — 


LAK cells 1L-2+ IL-18 


5.0 




dKj.Kj 


LAK cells 

i lvLru lunLniiyuux 


2.5 


NCLH292 IL-9 j 


34.4 


XTV r^lte TT 9 rpct 

ina. *_.eiis lJL-z resi i 


1.2 


NCI-H292 IL-13 


14.7 


I wo w ay j.vi.i^i\ j nay 


1.3 


NCI-H292 IFN gamma 


24.5 


Two Way MLR 5 day 


1.8 


HPA£C none 


7.6 


Two Way MLR 7 day 


2.0 


HPAJEC TNF alpha + JUL-1 beta . 


1^1 


PBMC rest 


1.2 


Lung fibroblast none 


4.4 


PBMC PWM 


24.5 


Lung fibroblast T>iF alpha + IL- 
1 beta 


2.1 


PBMC PHA-L 


12.2 


Lung fibroblast IL-4 


4.8 


Ramos (B cell) none 


1.6 


Lung fibroblast IL-9 


4.1 


Ramos (B cell) ionomycin 


14.5 


Lung fibroblast IL-13 


2.4 


B lymphocytes PWM 


26.2 


Lung fibroblast IFN gamma 


12.2 


B lymphocytes CD40L 
and IL-4 


4.3 


Dermal fibroblast CCD 1070 rest 


2.3 


EOL-1 dbcAMP 


0.0 


Dermal tibroblast L,cu i u /u 
TNF alpha 


8.6 


POT -1 rfhr A "MP 

1ZKJ±^~ 1 UUWTJV4X 

PMA/ionomycin 


0.1 


Dermal fibroblast CCD 1070 IL- 
1 beta 


0.5 


Dendritic cells none 


0.7 


Dermal fibroblast IFN gamma 


1.2 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


1.9 


Dendritic cells anti~CD40 


0.9 


IBD Colitis 2 


0.2 


Monocytes rest 


0.1 


IBD Crohn's 


0.4 


Monocytes LPS 


0.4 


Colon 


3.5 


Macrophages rest 


0.8 


Lung 


7.6 


Macrophages LPS 


0.2 


Thymus 


14.9 


HUVEC none 


15.1 


Kidney 


21.3 


HUVEC starved 


49.3 







Table 44 . Panel 5 Islet 



|Rel. Exp.(%) 
Tissue Name Ag2881 , Run 
j 233070520 


Rel. Exp.(%) 
Ag2881,Run 
233682397 j 


Tissue Name 


Rel. Exp.(%) 
Ag2831,Run : 
233070520 


Reh Exp.(%) 
Ag2881, Run 
233682397 


97457 JPatient- j sQ Q 
02go_adipose J 


39.8 


94709_Donor 2 AM - A_adipose' 


12.7 


10.7 


97476 JPatient- j 12.6 


15.0 


947 1 OJDonor 2 AM - B_adipose 


7.8 1 


4.1 
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07sk_skeletal ■ 
muscle 


l_l ; 










97477 JPatient- 
07ut_uterus 


24.5 f 


28.7 

i 


9471 l_Donor 2 AM - C_adipose 


2.5 


1.8 


97478_Patient- 
07pl__placenta 


49.0 


51.8 


947i2_Donor 2 AD - A_adipose 


21.0 


15.3 


99167_Bayer 
Patient 1 


100.0 


100.0 


94713„DonoT 2 AD - B^adipose 


13.9 


20.4 


97482_Patient- 
08ut_uterus 


23.2 


20.6 


94714_Donor 2 AD - C_adipose 


21.6 ■ 


15.6 


97483_Patient- 
08pl_placenta 


48.6 


' 40.9 


94742_Donor 3 U - 

A Mesenchymal Stem Cells 


3.0 | 


2.3 


97486_Patient- 
09sk skeletal 1 
muscle 


4.0' 


$.0 


94743 _Donor 3 U - 
B_Mesenchymal Stem Cells 


O.J 


A 1 


97487_Patien|- 
09ut_uterus 


13.9 


41.8 


94730JDonor 3 AM - A_adipose 


15.7 


32.6 


97488JPatient- . 
09pl_placenta 


29.1 


29.5 


94731_Donor 3 AM - B_adipose ; 


8.1 


2.6 


97492_Patient- 
1 Out_uterus 


19.1 


28.1 


94732_Donor 3 AM - C_adipose 


10.4 ! 


6.6 


97493_Patient- 
1 Opl__placenta 


82.9 


94.0 


94733_Donor 3 AD - A^adipose . 


14.3 ! 


18.0 


97495_Patient- j 
llgo_adipose 


41.2 


42.0 


94734_Donor 3 AD - B_adipose 


4.9 


4.8 


97496_Patient- 
1 1 sk_skeletal 
muscle 


18.7 


17.9 


94735 J3onor 3 AD - C_adipose , 


11.7 


13.2 


97497 JPatient- 
1 lut_uterus 


49.3 


49.7 


77 1 38_Liver__HepG2untreated 


46.0 


42.0 


97498 JPatient- 

1 lpljplacenta j 


25.9 


35.1 


73556JHeart_Cardiac stromal j 
cells (primary) 


96.6 


72.2 


97500_Patient- ; 
12go_adipose 


42.0 


39.5 


81735_Small Intestine 


29.5 S 


34.6 


97501_Patient- 
12sk_skeletal j 
muscle 


24.3 


44.4 


72409_Kidney_Proximal 
Convoluted Tubule 


8.8 


4.4 


97502_Patient- 
12ut uterus 


42.0 


39.2 


82685_Small 
mtestineJDuodenum 


1.8 


1.3 


97503_Parient- 
12plj3lacenta 


24.1 


18.3 


90650_Adrenal_AdrenocorticaI i 
adenoma 


16.4 


13.0 


94721_Donor2U- 
AJVlesenchymal 
Stem Cells 


5.1 


8.8 


72410_Kidney„HRCE 


17.2 


17.4 


94722 JDonor 2 U - , 
B Mesenchymal 
Stem Cells 


2.2 


4.0 


724UJCidney_HRE 


25.7 


28.7 


94723_Donor2U- 
C^Mesenchymal 
Stem Cells 


3.5 


4.9 


73139_Uterus__Uterine smooth 
muscle cells 


11.5 


4.5 



CNS^neurodegenerationjvl.O Summary: Ag2881 The results from this experiment 



confirm the expression of this gene at moderate level in the CNS in an independent group of 
patients. However, no differential expression of the CG56089-01 gene is found between 
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Alzheimer's disease and control postmortem brains on this panel. Please see Panel 1 3D for a 
discussion of utility. 

Panel 1.3D Summary: Ag2881 Expression of the CG56089-01 gerie is highest in a 
sample derived from normal colon tissue (CT = 27.3). Thus* expression of this gene could be 
5 used to distinguish colon tissue from the other samples in the panel. In addition, there is 
moderate expression of this gene in a number of lung cancer cell lines (CTs = 29,5-31). 
Therefore, therapeutic modulation of this gene or its protein product, through the use of small 
molecule drugs, antibodies or protein therapeutics, might be of benefit in the treatment of lung 
cancer. This gene is also expressed ,at moderate levels in all CNS regions examined, including 

10 in amygdala, cerebellum, hippocampus, substanti nigra, cerebral cortex, thalamus and spinal 
cord, suggesting a potential role in normal function of the central nervous system. 

Among tissues with metabolic or endocrine function, this gene is expressed at 
moderate levels in adipose, heart and skeletal muscle and at low levels in pancreas, adrenal 
gland, thyroid, pituitary gland, and liver. The CG56089-01 gene is homologous to MAST205, 

15 which was originally identified as microtubule-associated serine/threonine protein kinase from 
mouse testis (ref. 1); the MAST205 protein is also referred to as syntrophin associated kinase. 
MAST205 interacts with b2-syntrophin via PDZ domain interactions (ref. 2) and co-localizes 
at neuromuscular junctions with b2~syntrophim The activity of syntrophin associated S/T 
kinases is regulated by phosphorylation. Therefore, the homologous CG56089-01 protein may 

20 be of utility in the treatment of muscular dystrophy or other neuromuscular disorders. 
Furthermore, the relatively high expression of this gene in adipose and skeletal muscle 
suggests that modulation of the activity of this gene or its protein product, using protein 
therapeutics, antibodies or small molecule drugs, could be of use in the treatment of both 
obesity and type II diabetes (Walden PD, Millette CF. 1996. Increased activity associated with 

25 the MAST205 protein kinase complex during mammalian spermiogenesis. Biol Reprod 
55:1039-44; Lumeng C, Phelps S, Crawford GE, Walden PD, Barald K, Chamberlain JS. 
Interactions between beta 2-syntrophin and a family of microtubule-associated 
serine/threonine kinases. Nat Neurosci 1999 Jul;2(7):61 1-7). 

Panel 2.2 Summary: Ag2881 Expression of the CG56089-01 gene is highest is a 

30 sample derived from normal kidney tissue adjacent to a kidney cancer (CT = 30.3). In 

addition, there is substantial expression in a number of other normal tissues, including two 
samples derived from normal tissue surrounding lung cancers, normal stomach, normal breast 
and normal colon. Thus, the expression of this gene could be used to distinguish these tissues 
from the other tissues on this panel. Moreover, therapeutic modulation of this gene or its 
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protein product, through the use of small molecule drugs, antibodies or protein therapeutics, 
might be of benefit for the treatment of lung cancer, breast cancer, kidney cancer or colon 
cancer. ' 

Panel 4D Summary: Ag2881 The CG56089-01 gene' is expressed at moderate to low 
5 levels across the majority of samples on this panel. However, this gene is highly expressed in 
small airway epithelium treated with the pro-inflammatory cytokines TNF-a and IL-lb (CT = 
26.4), in human umbilical vein epithelial cells (HUVEC), in the muco-epidermoid cell line 
H292, and bronchial epithelium treated with TNF-a and IL-1 . 

The CG56089-01 gene encodes a protein with homology to MAST205, a microtubule- 

10 associated serine/threonine kinase, that has a PDZ domain that allows interaction with beta 2- 
syntrophin. It is possible that this protein can interact with other proteins through its PDZ 
1 domain including, for example, synthenin, a protein that plays a role in cytokine regulation 
(ref. 1). Therefore, modulation of the expression or activity of the CG56089-01 gene product 
by small molecule drugs could be useful to prevent or reduce the symptoms associated with 

1 5 lung inflammatory reactions occurring in chronic obstructive, pulmonary disease, asthma, and 
emphysema. Since this transcript is also found at lower levels in activated T and B cells, 
therapeutic modulation of this gene or its protein product using protein therapeutics, 
monoclonal antibodies or small molecule drugs, could also be used to treat the symptoms 
associated with autoimmune and inflammatory diseases, such as inflammatory bowel disease, 

20 systemic lupus erythematosus, or rheumatoid arthritis (Geijsen N, Uings IJ, Pals C, Armstrong 
J, McKinnon M, Raaijmakers JA, Lammers JW, Koenderman L, Coffer PJ. Cytokine-specific 
transcriptional regulation through an IL-5Ralpha interacting protein. Science 2001 Aug 
10;293(5532):1136-8). 

Panel 5 Islet Summary: Ag2881 The CG56089-01 gene is expressed at low levels 

25 (CTs = 32-35) in the various types of adipose and skeletal muscle collected from the different 
patient and donor groups represented on this panel, further indicating an importance for this 
gene in the regulation of metabolism and energy balance. The gene is also expressed in human 
islets indicating a potential importance in insulin secretion. This gene localizes to the type II 
diabetes QTL at 5ql2.3. When phosphorylated, b2-syntrophin may negatively influence 

30 insulin secretion (See Panel 1 .3D for a description), suggesting that therapeutic modulation of 
the CG56089-OI syntrophin associated kinase-like gene or its protein product, using protein 
therapeutics, antibodies or small molecule drugs, may influence insulin secretion in type II 
. diabetes. 
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F. NOV6: Kiloii-like 

Expression of the NOV6 gene (CG56087-01) was assessed using the primer-probe sets 
Ag2847 and Ag2880,' described in Tables 45-46. Results of the RTQ-PCRruns are shown in 
Tables 47-51.' 
5 Table 45 , Probe Name Ag2847 



Primers 


Sequences ; 


Length 


Start 
Position 


Forward 


5 ' -agggactacagcctccagatac~3 ' (SEQ ID NO:214) 


22 


311 : 


Probe 


TET-5 ' -atggcccatacacgtgttctgttcag-3 • -TAMRA (SEQ 
ID NO: 215) 

1 1- __ 


26 


354 


Reverse 


5' -cattgttctgggtgtatgttga-3 ' (SEQ ID NO;216) 


22 


382 



Table 46 . Probe Name Ag2880 



Primers 


Sequences 

1 


Length 


Start 
Position 


Forward 


5 ' -gctggtaccttgtgttgacact-3 • (S 1 EQ ID NO: 217) 


22 


1011 


Probe 


TET-5 » -ccagcatattctacctgaagaatgcca-3 ' -TAMRA (SEQ 
ID NO:218) 


27 


1044 


Reverse 


5 ' -aaagccttttatgggtctttga-3 ' (SEQ ID N0:219) 


22 j 


1084 



Table 47 . CNS_jieurodegeneration_vl .0 



Tissue Name 


Rel. Exp.(%) 
Ag2847, Run 
208699894 


Rel. Exp.(%) \ 
Ag2880,Run 
209058910 


Tissue Name 


Rel. Exp.(%) 
Ag2847, Run 
208699894 ! 


Rel. Exp.(%) 
Ag2880, Run 
209058910 


AD 1 Hippo 


5.6 ! 


2.4 ; 


Control (Path). 
3 Temporal Ctx 


1.9 


0.3 


AD 2 Hippo 


13.9 j 


10.5 


Control (Path) 
4 Temporal Ctx 


29.3 


12.8 


AD 3 Hippo 


2.7 


1.1 


AD 1 Occipital 

Ctx , 


11.0 


4.6 


AD 4 Hippo 


3.1 


0.5 


AD 2 Occipital 
Ctx (Missing) 


0.0 


0.0 


AD 5 Hippo 


84.1 


100.0 


AD 3 Occipital 
Ctx 


1.9 


1.0 


AD 6 Hippo 


19.9 


19.8 


AD 4 Occipital 
Ctx 


12.2 


2.7 


Control 2 
Hippo 


15.5 


12.9 


AD 5 Occipital 
Ctx 


39.0 


18.9 


Control 4 
Hippo 


2.1 


1.4 ; 


AD 6 Occipital 
Ctx 


24.0 


46.7 


Control (Path) 
3 Hippo 


1.3 


0.4 


Control 1 
Occipital Ctx 


0.7 


0.3 


AD 1 

Temporal Ctx 


4.6 


2.0 


Control 2 
Occipital Ctx 


50.3 


66.4 


AD 2 

Temporal Ctx 


22.8 


13.9 


Control 3 ; 
Occipital Ctx 


12.6 


3.9 


AD 3 

Temporal Ctx 


2.1 


0.7 


Control 4 
Occipital Ctx 


1.6 


1.0 


AD 4 

Temporal Ctx 


15.0 


4.0 


Control (Path) 
1 Occipital Ctx 


69.3 


93.3 
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AD 5 Inf 
Temporal Ctx 


100.0 


83.8 


Control (Jratii) 
2 Occipital Ctx 


9.8 


3.6 


AL) 5 oup 
Temporal Ctx 


20.3 


16.6 


Control \r atnj . 
3 Occipital Ctx 


0.6. 


0.3 


AJD o lnt i 
Temporal Crx 


24.3 


28.7 


Control (ratli) 
4 Occipital Ctx 


11.2 


4.7 


AX) o oup 
Temporal Ctx 


24.5 


29.3 


t^onrroi i 
Parietal Ctx 


2.5 


0.6 


Control 1 
J emporal Ctx 


1.6 


0.4 ' 


Control 2 
ranetai ctx 


■ 19.3 j 


13.3 


Control 2 
Temporal Ctx 


30.1 


35.6 1 


Control 3 
Parietal Ctx 


i2.6 j 


4.9 


Control 3 
Temporal Ctx 


7.3 


2.6 


Control (Path) j 
1 Parietal Ctx 


• 62.9 


92.7 


Control 3 
Temporal Ctx 


2.6 


■ 1.3 


Control (Path) 
2 Parietal Ctx ; 


15.5 j 


i 

9.5 


Control (Path) 
1 Temporal 
Ctx 


44.1 


54.7 


Control (Path) j 
3 Parietal Ctx ! 


1.2 | 


0.3 


Control (Path) 
2 Temporal 
Ctx 


22.7 


15.7 


Control (Path) ' 
4 Parietal Ctx , 


40.1 


27.0 



Table 48. Panel 1.3D 



Tissue Name 


Rel.Exp.(%) ; 
Ag2847, Run 
161930455 


Rel. Exp.(%) 
Ag2880,Run 
159996472 


Tissue Name 


Re!. Exp.(%) 
Ag2847, Run 
161930455 


Re). Exp.(%) 
Ag2880, Run 
159996472 


Liver 

adenocarcinoma 


» 


0.3 


Kidney (fetal) 


0.9 


1.3 


Pancreas 


0.5 


0.5 


Renal ca. 786-0 ' 


0.2 


0.3 


Pancreatic ca. 
CAP AN 2 


0.0 \ 


0.0 


Renal ca. A498 


0.5 


0.5 


Adrenal gland 


1.1 


1.8 


Renal ca. RXF 
393 


0.0 


0.0 


Thyroid 


1.4 


1.4 


Renal ca. ACHN j 


0.5 


0.3 


Salivary gland 


0.3 


0.3 


Renal ca. UO-31 


4.3 


4.1 


Pituitary gland 


2.6 


6.0 


RenaJ ca. TK-10 i 


0.0 


0.1 


Brain (fetal) 


4.8 


8.7 


Liver 


0.0 


0.1 


Brain (whole) 


19.8 


21.5 


Liver (fetal) 


0.1 


0.1 


Brain (amygdala) 


20.9 


31.9 


Liver ca. 

(hepatoblast) 

HepG2 


0.0 


0.0 


Brain (cerebellum) 


25.0 


15.2 


Lung 


2.2 


4.2 


Brain (hippocampus) 


38.4 


100,0 


Lung (fetal) 


1.2 


0.4 


Brain (substantia 
nigra) 


3.6 


4.0 


Lung ca. (small 
cell) LX-1 


0.0 


0.0 


Brain (thalamus) 


5.9 


9.2 


Lung ca. (small ' 
cell)NCJ-H69 


0.3 


0.6 


Cerebral Cortex 


100.0 


40.3 


Lung ca. (s.cell 
var.) SHP-77 


1.6 


1.3 


Spinal cord 


11.1 


2.7 


Lung ca. (large 
cell)NCLH460 ; 


0.2 


0.1 


glio/astro U87-MG i 


1-3 


0.3 


Lung ca. (non- 


0.0 


0.0 
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sm. cell) A549 






glio/astro U-118- 
MG 


9.7 


20.7 


Lung ca. (non- 
s.cell) NCI-H23 


0.1 


0.0 


astrocytoma 
SW1783 ' 


2.9 


1.5 


Lung'ca. (non- 
s.cell)HOP-62 


2.0 


1 .3- 


neuro*; met SK-N- 
AS 


0.5 


2.4 


Lung ca., (non- 
s.cl) NCI-H522 


O.l 


0.0 


uo li y njxxia o J ~> ~j j 


0.8 


0.4 


Lung ca. 

(squam.) SW 900 


0.0 


0.0 


astrocytoma SNB-75; 


0.2 


0.1 


Lung ca. , 
(sq'uam.) NC1- 
H596 


0.3 


0.2 


gIiomaSNB-I9 | 


r 0.2 


0.2 


Mammary gland 


1.4 


3.8 




0.8 


0.4 


Breast ca.* 
(pl.ef)MCF-7 


0.0 


0.0 i 


glioma SF-295 


0.0 


0.0 


Breast ca.* 
(pl.ef) MDA- 
MB-231 


0.7 


3.8 


Heart (fetal) 


5.4 


1.0 


Breast ca.* 
(pi. el) T47D 


0,0 


0.0 


Heart 


4.7 


1.9 


Breast ca. BT- 
549 


0.2 


2.0 


Skeletal muscle 
(fetal) 


22.7 


6.8 


Breast ca. MDA- < 
N 


0.0 


0.0 


Skeletal muscle 


3.8 


0.6 


Ovary 


5.2 


1.4 


Bone marrow 


0.3 ' 


0.6 


Ovarian ca. 
OVCAR-3 


0.9 


l.O 


Thymus 






Ovarian ca. 
OVCAR-4 


0.0 


0.0 


Spleen 


0.1 


0.1 


Ovarian ca. 
OVCAK.-5 


0.0 


0.0 


Lymph node 


0.7 


0.6 


Ovarian ca. 
OVCAR-8 


1.3 


l.l 


Colorectal 


9.0 


1.7 


Ovarian ca. 
JGROV-l 


0.0 


0.0 


Stomach 


2.2 


4.0 


Ovarian ca.* ■ 
(ascites) SK-OV-! 

3 


0.0 


0.3 


Small intestine i 


5.6 


7.1 


Uterus | 


2.2 


2.5 


Colon ca. SW480 


0.0 


0.0 


Placenta 


0.5 


0.6 


Colon ca.* 
SW620(SW480 met) 


0.0 i 


0.0 


Prostate 


0.7 


0.3 


Colon ca. HT29 ; 


0.0 j 


0.0 


Prostate ca.* 


1.4 


1. 7 


Colon ca.HCT-1 16 


0.0 


0.0 


Testis 


1. 7 


L2 


Colon ca. CaCo-2 


0.0 


0.0 


Melanoma 
Hs688(A).T 


5.4 


2.4 


Colon ca. 
tissue(OD03866) 


2.5 


0.9 


Melanoma* 
(met) 

Hs688(B).T 


5.5 


2.3 


Colon ca. HCC-2998! 


0.0 


0.0 


Melanoma 
UACC-62 


0.0 


0.0 


Gastric ca.* (liver = 


0.0 


0.4 


Melanoma Ml 4 


0.0 


0.4 
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met) NCI-N87 • 












Bladder 


H3 " : 


0.3 


Melanoma LOX 
IMVI 


0.0 


0.1 


Trachea 


3.9 


4.0 , , 


Melanoma* 
(met) SK-MEL-5 


, r 

0.1 


0.0 


Kidney 


2.3 


0,4 


Adipose 


8.6 


3.2 



Table 49 . Panel 2D 



Tissue Name 


Rel. Exp.(%) 
Ag2847, Run 
1 61 930456 


Rel. Exp.(%) 
Ag2880,Run 


Tissue Name 


Rel. Exp.(%) 
Ag2847, Run 


Rel. Exp.(%) 
Ag28&0, Run 


Normal Colon 


' 100.0 


lOO.O 


Kidney Margin 
812060b 


7.2 


2.3 


CC Well to Mod 
Diff(OD03S66) 


. 2.4 


4.2 


Kidney Cancer 
8120613 


0.0 


0.0 


CC Margin . 
(OD03866) 


19.1 


26.6 


Kidney Margin 
8120614 


10.7 


3.0 


CC Gr.2 

rectosigmoid 

(OD03868) 


2.7 


3.4 


Kidney Cancer 
9010320 


1.3 


0.7 


CC Margin 
(OD03868) 


15.4 


i ■ 

17.2 


Kidney Margin 
9010321 


9.6 


5.1 


CC Mod Diff 
(ODO3920) 


1.6 


0.7, 


Normal Uterus 


17.9 


12.9 


CC Margin 
(ODO3920) 


29.5 i 


23.0 


Uterus Cancer 
06401 1 


43.2 


22.7 


CC Gr.2 ascend 
colon (OD03921) \ 


17.6 


17.9 | 


Normal Thyroid 


7.4 


12.6 


CC Margin 
(OD03921) 


18.2 


22.4 


Thyroid Cancer 
064010 


2.0 ;' 


l.O 


CC from Partial ; 
Hepatectomy 
(ODO4309) Mets 


0.3 


l.O 


Thyroid Cancer 1 
A302152 


1.2 


1.8 


Liver Margin 
(ODO4309) 


0.1 


J. I 


Thyroid Margin 
A302153 ~ . 


16.0 


17.6 


Colon mets to lung 
fOD04451-0n 


2.0 


0.8 


Normal Breast 


16.7 


13.2 


Lung Margin 
(OD04451-02) 


3.8 


4.3 


Breast Cancer 
(OD04566) 


22.5 i 


1 1,2 


Normal Prostate I 
6546-1 


1.8 


3.9 


Breast Cancer 
(OD045 90-01) 


9.2 


8.9 


Prostate Cancer \ 
(OD04410) 


14.3 


19.3 


Breast Cancer 
Mets (OD04590- 
03) 


26.4 


20.6 


Prostate Margin 
(OD04410) 


19.9 


16.7 


Breast Cancer ; 
Metastasis ' i 
(OD04655-05) 


2.9 


4.6 


Prostate Cancer 
(OD04720-01) 


17.2 


17.4 


Breast Cancer 
064006 


5.3 : 


8.4 


Prostate Margin 
(OD04720-02) 


22.2 


29. 1 \ 


Breast Cancer 
1024 : 


8.5 j 


6.7 


Normal Lung 
061010 


35.4 


43.2 ; 


Breast Cancer 
9100266 


7.9 


8.7 


Lung Met to Muscle 


14.2 


13.0 


Breast Margin 


7.9 


5.6 
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(OD04286) • 




|9 100265 






Muscle Margin 
(OD04286) 


H.9 


" 1? R • (Breast Cancer 
12,15 JA209073 


10.9 


13.7 


Lung Malignant 
Cancer (OD03126) 


6.6 


jBreast Margin 
L5 " JA2090734 


2.5 ' 


4.1 


Lung Margin 
(OD03126) 


10.3 


10.2 ||Normal Liver 


0.7 


0.7 


Lung Cancer 
(OD04404) 


2.6 


llLiver Cancer 
iA (064003 


0.1 


0.2 


Lung Margin 
(OD04404) 


12.9 » 


jLiver 'Cancer 
i2U j] 1025 


0.3 


0.1 , 


Lung Cancer 
(OD04565) 


0.9 


2.0 


Liver Cancer 
1026 


0.2 


0.1 


Lung Margin 
(OD04565) 


. 4.0 


2.1 


Liver Cancer 
6004-T 


0.1 


0.1 


Lung Cancer ■ 
(OD04237-01) 


2.7 


3.7 ' 


Liver Tissue 
6004-N 


0.2 


0.3 


Lung Margin 
(OD04237-02) 


17.6 


16.8 


Liver Cancer 
6005-T 


' 0.5 


0.2 


Ocular Mel Met to 
Liver (ODO4310) 


0.1 


i ' 0.3 


Liver Tissue 
6005-N 


0.1 


0.0 


Liver Margin ■ 
(ODO4310) 


0.2 


0.0 


Normal Bladder 


10.1 


17.1 


Melanoma Mets to 
Lung (OD04321) 


7.8 


7.2 


Bladder Cancer 
1023 


1.8 


1.3 


Lung Margin 
(OD04321) 


3L0 


23.5 


Bladder Cancer 
A302173 


3.9 


5.4 


Normal Kidney 


41.8 


58.2 ' 1 


Bladder Cancer 
(OD047 18-01) 


4.4 


2.2 


Kidney Ca, Nuclear 
grade 2 (OD04338) 


3.4 


7.9 


Bladder Normal ; 
Adjacent 
(OD047 18-03) 


97.9 ' 


88.3 


Kidney Margin E 
(OD04338) i 


22.4 


14.3 i 


Normal Ovary , 


7.2 


4.0 


Kidney Ca Nuclear [ 
grade 1/2 
(OD04339) 


0.7 \ 


3.5 


Ovarian Cancer 
064008 


14.0 


14.8 


Kidney Margin ' 
(OD04339) 


17.3 i 


15.3 


Ovarian Cancer 
(OD047 68-07) 


0.1 


0.4 


Kidney Ca, Clear 
cell type (OD04340) 


1.3 


4.0 


Ovary Margin 
(OD04768-08) 


6.4 


7.6 


Kidney Margin 
(OD04340) 


35.4 


40.1 | 


Normal Stomach 


40.6 


46.3 


Kidney Ca, Nuclear 
grade 3 (OD04348^ 


0.6 


0.2 j 


Gastric Cancer 
9060358 


10.8 


9.0 


Kidney Margin ■ 
(OD04348) 


10.0 


11.2 ' 


Stomach Margin ! 
9060359 


9.8 


11.3 


Kidney Cancer 
(OD04 622-01) 


2.0 


0.9 


Gastric Cancer ; 
9060395 


26.6 


36.1 


Kidney Margin 
(OD04622-03) \ 


2.4 


3.1 


Stomach Margin 
9060394 


14.2 


14.7 


Kidney Cancer 
(OD04450-01) 


0.0 


2.1 


Gastric Cancer 
9060397 


7.5 


7.9 


Kidney Margin 


19.8 


13.0 


Stomach Margin 


5.1 j 3.8 
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(OD04450-03) 




(9060396 


i 


Kidney Cancer 
8120607 


2.0 


1 s j Gastric Cancer 
J064005 


21.0 


22.4 



Table 50 , Panel 4D 



Tissue Name 


Rel. Exp.(%) 
Ag2847, Run 
159841938 


Rel ExD.f%i 
Ag2880, Run 
159996551 


Tissue Name 


Rel. Exp.(%) 
Ag2847, Run 
159841938 


Rel. Exp.(%) 
Ag2880, Run 
159996551 


Secondary Thl act 


0.0 


0.2 , 


HIJVEC IL-lbeta 


3.3 


16.0 


Secondary Th2 act 


0.1 


0.0 


HUVEC IFN gamma 


3.8 


17.8 


Secondary Trl act 


0.0 


0.0 


rlUVbC INr alpha + 
IFN gamma 


3.5 


13.9 


Secondary Thl rest 


0.0 


0.0 


HUVbC INF alpha + 
IL4 


5.1 


15.8 


Secondary Th2 rest 


0.0 


0.0 


HUVEC IL-1 1 


1.8 


5.3 


Secondary Trl rest 


0.0 


0.0 


Lung Microvascular 
EC none 


1.8 


6.6 


Primary Thl act 


0.0 


0.0 


Lung Microvascular 
EC TNFalpha + IL- 
lbeta 


2.1 


12.1 


Primary Th2 act 


0.3 


0.6 ! 


Microvascular 
Dermal EC none 


1.6 


4.6 


Primary Trl act 


0.0 


1.2 


Microsvasular Dermal 
EC TNFalpha + IL- ; 

1 Upfn 

1 DCla 


0.7 


1.3 


Primary Thl rest 


0.0 


0.6 


Bronchial epithelium 


0.0 


2.7 


Primary Th2 rest 


100.0 


0.4 


Small airway 
epithelium none 


0.1 


0.7 


Primary Trl rest 


0.0 


0.0 


Small airway 
epithelium TNFalpha ; 
+ IL-lbeta 


0.0 


1.6 


CD45RA CD4 
lymphocyte act 


2,0 


21.5 


Coronery artery SMC ; 
rest 


7.4 


53.2 


CD45RO CD4 
lymphocyte act 


0.0 


0.7 


Coronery artery SMC 
TNFalpha + IL-lbeta 


6.1 


27.5 


CD 8 lymphocyte act ! 


0.1 


0.2 


Astrocytes rest 


1.4 


5.7 


Secondary CDo i 
lymphocyte rest 


0.0 


0.5 


Astrocytes UNr alpha 
+ IL-lbeta 


0.7 


5.8 


occuiiuai y K^iso 

lymphocyte act 


0.0 


0.0 


KU-812 fBasonhiU 
rest 


0.0 ' 


0.7 


CD4 lymphocyte 
none 


0.0 


0.0 


KU-812 (Basophil) I 
PMA/ionomycin 


0.0 


0.5 


2ry 

Thl/Th2/Trl anti- ; 
CD95CH11 


0.0 


0.0 


CCD 1106 

(Keratinocytes) none 


0.0 


0.0 


LAJC cells rest 


0.0 


0.0 


CCD1106 
(Keratinocytes) 
TNFalpha + IL-lbeta 


0.0 


0.0 


LAK cells IL-2 


0.0 


0.3 


Liver cirrhosis 


0.2 


1.3 


LAK cells IL-2+IL- 
12 


0.0 


0.2 


Lupus kidney 


1.5 


3.5 


LAK cells IL-2+IFN 
gamma 


0.0 


0.2 


NCI-H292 none 


0.5 


2.4 
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LAK cells IL-2+ IL- 
18 


0.0 


0.2 


NCI-H292 IL-4 


1.5 


7.5 

■ — ~ 


LAK cells 
PMA/ionomycin 


0.0 


0,0 


NCLH292 IL-9 , 


.1.5 


9.3 


NK Cells IL-2 rest 


0.0 


0.0 


NCI-H292 IL-13 


0.9 


3.5 


Two Way MLR 3 
day 


0.0 


0.7 


NCI-H292 IFN 


0.7 


2.4 


Two Way MLR 5 
day 


0.0 


0.4 


HPAEC none 


3.8 


16.5 


Two Way MLR 7 
day 


0.0 


0.1 


HPAEC TNF alpha + 

IT 1 Wtfl 
1Lj-i UCla 


2.7 


17.4 


PBMC rest 


0.0 


0.0 


Lung fibroblast none \ 


1.2 


7.3 


PBMC PWM j 


OA 


0.1 


Lung fibroblast TNF 
alpha + IL- 1 beta 


1.1 


3.2 


- — "~ 

PBMC PHA-L 


0.1 


0.0 


Lung fibroblast IL-4 


2.0 


14.6 


Ramos (B cell) none 


1.7 


7.0 


Lung fibroblast IL-9 


3.0 


15.3 


Ramos (B cell) 
ionomycin 


25.3 


100.0 


Lung fibroblast IL-13 


1.2 


8.4 


B lymphocytes PWM 


0.0 


0.4 


Lung fibroblast IFN 
gamma 


2.5 


15.2 


B lymphocytes 
CD40L and IL-4 


0.0 


0.8 


Dermal fibroblast 
CCD 1070 rest 


12.6 


89.5 


FOT -1 dbrAMP 

y.j V/ \ j y KJ U\^J YJv.il 


0,0 


0.0 


Dermal fibroblast 
CCD 1070 TNF alpha 


12.1 


88.3 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


0.2 


Dermal fibroblast 
CUD10/U JLL-J beta 


6.3 


51.4 


Dendritic cells none 


0.0 


0.0 


Dermal fibroblast IFN 
gamma 


3.5 


20.7 


Dendritic cells LPS 


0.0 


0.1 


Dermal fibroblast IL- 
4 


6.3 ; 


35.6 


Dendritic cells anti- ■ 
CD40 


0.0 


0.6 


IBD Colitis 2 


0.7 


5.0 


Monocytes rest 


0.4 


1.7 


EBD Crohn's 


0.9 


6.1 


Monocytes LPS 


0.0 


0.0 


Colon 


5.0 


42.0 


Macrophages rest 


0.0 


0.0 


Lung 


2.6 


16.5 


Macrophages LPS 


0.0 


0.0 


Thymus 


4.7 


44.4 


HUVEC none 


7.2 


33.7 


Kidney 


3.5 


_ 19.2_ 


HUVEC starved 


11.3 


70.7 









Table 51 . Panel CNS J 



Tissue Name 


Rel.Exp.(%) 
Ag2847, Run ! 
171669934 


Reh Exp.(%) j 
Ag2880, Run \ 
171688447 


Tissue Name 


Rel. Exp.(%) 
Ag2847, Run 
171669934 


Rel. Exp.(%) 
Ag2880, Run 
171688447 


BA4 Control 


31.2 


13.0 


BA17 PSP 


34.2 


8.7 


BA4 Control2 


61.6 


58.6 


BA17PSP2 


9.6 


2.6 


BA4 

Alzheimer's2 


6.3 


0.7 


Sub Nigra 
Control 


12.2 


5.3 


BA4 

Parkinson's 


36.6 


15.9 


Sub Nigra 
Control2 


19.6 


19.6 


BA4 

Parlonson , s2 


68.8 


60.3 


Sub Nigra 
Alzheimer's2 


5.9 


1.2 


BA4 


31.6 


30.1 


Sub Nigra 


24.0 


17.7 
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Huntington's 1 






Parkinson's2 






BA4 

Huntington's2 


4.6 M 


0.0 


Sub" Nigra 
Huntington's 


31.9 


14.6 


BA4 PSP 


7.7 


i 

1.1 


Sub Nigra ' 
Huntington's2 


11.2 


. 7.3 


BA4 PSP2 


27.7 


9.7 


Sub Nigra PSP2 


6.0 


2.5 


BA4 

Depression 


14.5 


6.3 


Sub Nigra 
Depression 


2.5 


3.2 


BA4 

Depression2 


8.3 


0.0 


Sub Nigra 
Depressipn2 


3.7 


1.2 


BA7 Control 


53.2 


— , ,„ — — — — - 

23.3 


Glob Palladus 
Control 


5.1 


2.6 


BA7 Control 


33.7' 


, 25.3 


Glob Palladus 
Control 


6.1 


2.5 


BA7 

Alzheimer's2 ' 


11.6 


2.3 


Glob Palladus 
Alzheimer's 


4.5 


2.1 


BA7 

Parkinson's 


16.6 


5.3 


Glob Palladus 
Alzheimer's2 


1.7 | 

i 


1.1 


BA7 

Parkinson*s2 


51.4 


45.i j 


Glob Palladus 
Parkinson's 


36.6 


25.5 


BA7 

Huntington's 


42.3 


22.8 


GJob Palladus 
Parkinson's2 


5.3 


1.2 


RA7 

Huntington's2 


50.7 


14.0 ■ 


Glob Palladus 
PSP 


1.7 


0.6 


BA7 PSP 


43.8 


21.8 


Glob Palladus 
PSP2 


3.9 


0.0 


BA7 PSP2 


36.3 


19.5 


Glob Palladus 
Depression 


2.0 


0.4 


BA7 

Depression 


12.7 | 


1.2 ; 


Temp Pole 
Control 


17.1 


9.3 


BA9 Control ■ 


25.0 | 


9.7 j 


Temp Pole 
Control2 


69.7 


57.8 


BA9 Control 


100.0 j 


100.0 \ 


Temp Pole 
Alzheimer's 


7.9 


0.3 


BA9 

Alzheimer's 


8.1 


1.1 i 


Temp Pole 
Alzheimer's2 


5.3 


1.3 


BA9 

Alzheimer's2 j 


18.3 


3.5 


Temp Pole 
Parkinson's 


30.6 


13.6 


BA9 

Parkinson's 


37.1 


13.5 


Temp Pole 
Parkinson's2 


29.3 


11.3 


BA9 

Parkinson's2 | 


63.3 


55.1 


Temp Pole 
Huntington's 


43.2 


18.3 


BA9 

Huntington's 


55.1 


32.3 


Temp Pole PSP 


7.0 


0.5 


BA9 

Huntington's2 


12.2 


0.9 


Temp Pole PSP2 


8.6 


0.5 


BA9 PSP 


15.2 , 


4.3 


Temp Pole 
Depression2 


4.5 


0.7 


BA9 PSP2 


7.2 


2.5 


Cing Gyr Control 


73.2 


40.6 


BA9 1 
Depression 


3.5 j 


3.1 


Cing Gyr 
Contro32 


38.2 


17.4 


BA9 1 
Depressi on2 


7.9 


1.6 


Cing Gyr 
Alzheimer's 


25.5 


8.8 
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BA17 Control 


59.0 


19.6 


Cing Gyr 
Alzheimer's2 


9.5 


1.4 


BA17 Control2 


67.8 


39.5* 


Cing Gyr 
Parkinson's 


, • 24.3' 


9.3 


BA17 

Alzheimer's2 


16.8 


1.8 


Cing Gyr 
Parkinson's2 


34.4 


32.8 


BA17 
Parkinson's 


37.4 


9.1 


Cing Gyr 
Huntington's 


63.7 


38.4 


BA17 

Parkinson's2 


56.6 


36.6 


Cing Gyr 
Huntington's2 


' 12.5 

i 


4.6 


BA17 

Huntington's 


37.1 


16.3 


Cing Gyr PSP 


15.5 


7.5 


BA17 

Huntington's2 


15.9 


5.1 


Cing Gyr PSP2 


5.8 


0.0 

_i — 


BA17 
Depression 


6.1 


' 0.0 


Cing Gyr 
Depression 


2.9 


0.0 


BA17 

Depression2 


28.1 


7.1 


Cing Gyr 
Depression2 


9.2 


1.8 



CNSjneurodegeneratioiijv'1.0 Summary: Ag2 847/28 80 The results from this 
experiment confirm the expression of this gene at moderate level in the CNS in an independent 
group of patients. However, no clear relationship between the expression levels of the 
CG56087-01 gene and Alzheimer's disease is evident. Please see Panel 1.3D for discussion of 



5 utility of this gene in the central nervous system. 

Panel 3 .3D Summary: Ag2 847/2880 Two experiments with two different probe and 
primer sets show highest expression of the CG56087-01 gene, a kilon homolog, in the brain. 
This gene is expressed at high to moderate levels throughout the central nervous system, 
including in amygdala, cerebellum, hippocampus, substantia nigra, thalamus, cerebral cortex 

10 and spinal cord. This expression profile is consistent with published reports of kilon ' 

expression. The sequence of kilon shows a high degree of homology to that of the chicken 
protein neurotractin, a molecule involved in neurite outgrowth capable of interacting with 
LAMP. Because this class of molecules is thought to play a role in the guidance of growing 
axons and kilon is expressed specifically in neurons, it has been suggested that they confer the 

1 5 ability to rearrange dendritic connectivity on magnocellular neurons. Degeneration of dendritic 
orphology and connectivity is a pathological characteristic of neurodegenerative diseases, such 
as Alzheimer's disease. Recombinant neurotractin promotes neurite outgrowth of telencephalic 
neurons and interacts with the IgSF members CEPU-1. Therefore, the CG56087-01 gene 
product may be used as a protein therapeutic to counter neurodegeneration in a variety of 

20 neurodegenerative diseases. 

hi contrast, expression of the CG56087-01 gene is relatively low in brain cancer 

derived cell lines when compared to its levels in the normal brain. Thus, expression of this 

gene could be used to distinguish brain-derived tissue from other tissues in the panel. 
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Moreover, therepeutic modulation of this gene or its protein product, through the use of small 
molecule drugs, antibodies or protein therapeutics, might be of use in the treatment of brain 
cancer. 1 ' 

This gene is also moderately expressed in a variety of metabolic and tissues including 
5 pancreas, adrenal gland, thyroid, pituitary gland, adult and fetal heart, and adipose. Thus, this 
gene may be an antibody target for the treatment of diseases in these tissues including Types 1 
and 2 diabetes, cardiovascular disease and obesity (Marg A, Sirim P, Spaltmann F, Plagge, A, 
Kauselmann G, Buck F, Rathjen FG, Brummendorf T. Neurotractin, a novel neurite 
outgrowth-pro'moting Ig-like protein that interacts with CEPU-1 and LAMP. J Cell Biol 1999 
] 0 May 1 7;145(4):865-76; Funatsu N, Miyata S, Kumanogoh H, Shigeta M, Hamada K, Endo Y, 
Sokawa Y, Maekawa S. Characterization of a novel rat brain glycosylphosphatidylinositc-1- 
anchored protein (Kilon), a member of the IgLON cell adhesion molecule family. J Biol Chem 

1999 Mar 19;274(12):8224-30; Brauer AU, Savaskan NE, Plaschke M, Prehn S, Ninnemann 
O, Nitsch R. IG-molecule Kilon shows differential expression patltern from LAMP in the 

1 5 developing and adult rat hippocampus. Hippocampus 2000; 1 0(6):632-44; Lodge AP, Howard 
MR, McNamee CJ, Moss DJ. Co-localisation, heterophils interactions and regulated 
expression of IgLON family proteins in the chick nervous system. Brain Res Mol Brain Res 

2000 Oct 20;82(L2):84-94; Miyata S, Funatsu N, Matsunaga W, Kiyohara T, Sokawa Y, 
Maekawa S. Expression of the IgLON cell adhesion molecules Kilon and OBCAM in 

20 hypothalamic magnocellular neurons. J Comp Neurol 2000 Aug 14;424(l):74-85). 

Panel 2D Summary; Ag2847/2880 Two experiments with different probe and primer 
sets produced results that are in very good agreement, with highest expression of the 
CG56087-01 gene in a sample derived from normal colon tissue (CTs=27-29). In addition, 
there is substantial expression of this gene in samples derived from normal colon tissue when 

25 compared to their adjacent malignant counterparts. The trend toward differential expression in 
normal tissues over their malignant counterparts is also seen in kidney samples and bladder 
samples. Thus, the expression of this gene could be used to distinguish normal colon, bladder 
or kidney from their malignant counterparts. Moreover, therapeutic modulation of this gene or 
its protein product, through the use of small molecule drugs, antibodies or protein therapeutics, 

30 might be of use in the treatment of colon, bladder or kidney cancer. 

Panel 4D Summary: Ag2880 The CG56087-01 transcript is expressed in endothelial 
cells, fibroblasts, activated Ramos B cells and activated CD45RA (naive) T cells but not in 
primary B cells. This gene encodes a putative adhesion molecule that has been hypothesized to 
be involved in the establishment and remodeling of neural circuits. The role of this protein in 
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the immune system has not been examined, however, based on its CNS function it may be 

»» "" " 1 ' ■ ' 

involved in cell-cell binding that leads to leukocyte interactions with endothelium resulting in 

leukocyte extravasastion. Alternatively, the protein encoded for by this transcript may be 

important in other cellular interactions. Therefore, therapeutics designed with the protein 

encoded for by this transcript could be important in the treatment of inflammation resulting 

horn asthma, chronic obstructive pulmonary disease, inflammatory bowel disease, arthritis, 

and psoriasis. Please note that data from a 'second experiment using the probe and primer $et 

Ag2847 are not included because the amp plot suggests that there were experimental 

difficulties with this run, 

Panel CNS_1 Summary: A £2847/2880 Two experiments with different probe and 
primer sets produce results that are in very good agreement, confirming expression of the 
CG56087-01 gene in the brain. . Please see Panel 1 .3D for discussion of utility of this gene in 
the central nervous system. » • 
G. NOV7: MIX£D LINEAGE KINASE 2-Hke 

Expression of the NOV7 gene (CG56071-01) was assessed using the primer-probe sets 
Ag2872 and Ag4847, described in Tables 52-53. Results of the RTQ-PCR runs are shown in 
Tables 54-63. 

Table 52 . Probe Name Ag2872 



Primers 


Sequences 


Length 


Start 
Position 


Forward 


5 ■ -tcagccagaccatagagaatgt-3 ' {SEQ ID NO;220) 


22 ... 


506 


Probe 


TET-5 ■ -atgctgaagcaccccaacatcatt'g-3 ' -TAMI^A (SEQ ID, 
NO:221) 


» 


553 


Reverse 


5 1 -ctccttcagacatacccctctt-3 ' {SEQ ID NO:222) 


22 


582 



Table 53 . Probe Name Ag4847 



Primers 


Sequences 


Length 


Start 
Position 


Forward 


5 ' -catagagaatgttcgccaagag-3 1 (SEQ ID NO:223) 


22 


516 


Probe 


TET-5 1 -atgctgaagcaccccaacatcattg-3 ' -TAMRA (SEQ ID 
NO: 224) 


25 


553 


Reverse 


5 ' -ctccttcagacatacccctctt-3 " (SEQ ID NO: 225) 


22 


582 



Table 54 . CNS_neurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag2872, Run 
209779299 


Tissue Name 


Rel. Exp.(%) Ag2872, Run 
209779299 


AD 1 Hippo 


7.4 | 


Control (Path) 3 
Temporal Ctx 


1.3 


AD 2 Hippo 


16.5 


Control (Path) 4 
Temporal Ctx 


24.5 


AD 3 Hippo 


5.3 


AD 1 Occipital Ctx 


15.8 


AD 4 Hippo 


5.3 


AD 2 Occipital Ctx 
(Missing) 


0.0 
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AD 5 Hippo 


100.0 


AD 3 Occipital Ctx 


3.3 


AD £% T-Tirmn 


29.7 


AD 4 Occipital Ctx 


20.7 


Control 2 Hippo 


31.9 


AD 5 Occipital Ctx 


52.5 


Control 4 Hippo 


4.6 


AD 6 Occipital Ctx 


31.2 


Control (Path) 3 Hippo 


2.9 


Control 1 Occipital Ctx 


0.9 


AD I Temporal Ctx 


9.5 


Control 2 Occipital Ctx 


79.6 


AD 2 Temporal Ctx 


17.2 


Control 3 Occipital Ctx 


17.8 


AD 3 Temporal Ctx 


4.5 


Control 4 Occipital Ctx 


2.1 


AD 4 Temporal Ctx 


15.0 


Control (Path) 1 
Occipital Ctx 1 


71.2 


AD S Tnf Tpmnnral pt-v 


81.2 


Control (Path) 2 
Occipital Ctx 


10,2 


AD 5 Sup Temporal 
Qtx 




Control (Path) 3 
Occipital Ctx 


0.6 

i 


/ vl_J I CIIipUl til I A 


24.7 


Control (Path) 4 
Occipital Ctx 


16.6 


AD 6 Sup Temporal 
Ctx 




Pnnrrnl 1 Parietal Ctx 
v^yjiiiiyji i * uncial v j iA 


2.6 


Contr ol 1 Temporal Ctx : 


3.0 


Control 2 Parietal Ctx ! 


24.7 


Control 2 Temporal Ctx 


40.9 


Control 3 Parietal Ctx 


17.0 


Control 3 Temporal Ctx ; 


14.9 


Control (Path) 1 
Parietal Ctx 


76.3 


Control 3 Temporal Ctx i 


4.9 


Control (Path) 2' 
Parietal Ctx 


20.2 


Control (Path) 1 
Temporal Ctx 


63.7 : 


Control (Path) 3 
Parietal Ctx 


2.2 


Control (Path) 2 
Temporal Ctx 


35.4 


Control (Path) 4 
Parietal Ctx 


33.4 


Table 55. General screening panel vl.5 


Tissue Name 


Rel. Exp.(%) Ag4847, Run 
228796410 


Tissue Name 


Rel. Exp.(%) Ag4847, Run 
228796410 


Adipose 


1.0 


Renal ca. TK-10 


13.9 


Melanoma* Hs688(A).T 


0.1 


Bladder 


5.8 


Melanoma* Hs688(B).T 


0.0 


Gastric ca. (liver met.) 
NCI-N87 


21.5 


Melanoma* M14 


5.4 


Gastric ca. KATO III 


14.7 


Melanoma* LOXIMVI 


1.6 


Colon ca. S W-948 


4.3 


Melanoma* SK-MEL-5 


9.2 


Colon ca. SW480 


16.2 


Squamous cell 
carcinoma SCC-4 


28.9 


Colon ca.*(SW4 80 met) : 
SW620 


5.4 


Testis Pool 


2.8 


Colon ca. HT29 


4.8 


Prostate ca.* (bone met) 
PC-3 


7.2 


Colon ca.HCT-1 16 


13.4 


Prostate Pool 


1.5 


Colon ca. CaCo-2 


13.6 


Placenta 


5.3 


Colon cancer tissue 


3.0 


Uterus Pool 


1.0 


Colon ca. SW1116 


2.3 


Ovarian ca. OVCAR-3 


10.1 


Colon ca. Colo-205 


2.7 


Ovarian ca. SK-OV-3 


23.5 


Colon ca. SW-48 


2.1 


Ovarian ca. OVCAR-4 


7.1 


Colon Pool 




0.6 


Ovarian ca. OVCAR-5 


24.0 jSmall Intestine Pool 


1.3 



282 



BNSDOCID; <WO_02055704A2_I_> 



WO 02/055704 



PCT/US02/00554 



■I 



Ovarian ca. IGROV-1 


29.5 


Stomach Pool 


1.3 


Ovarian ca. OVCAR-8 


3.7 


Bone Marrow Pool 


0.9 


Ovary 


1.0 


Fetal Heart 


0,9 


Breast ca. MCF-7 


10.5 


Heart Pool 


0.3 


"Rrpa<:t ra 1UDA-MR- 

231 


10.9 


Lymph Node Pool' 


0.9 




0 4 1 


Fetal Skeletal Mmcle 


0.1 


Rrea«;t ra T47T) 


1 3.7 


Skeletal Muscle Pool 


2.9 


JDICaal La. IVJJL'/Ti-j.N 




^rtlppn Pnnl 


2.4 


JDJ CaM j UU1 


1 1 

1 . J 


x xiy Julio i uui 


2.5 


Trachea 


3.9 

— ; _ (- 


f^TsJS ratirpr ( oKn/a^frrA 

U87-MG 


13.3 


Idling 


0.3 ; 


CNS cancer (glio/astro) U- 
118-MG 


0.5 , 


Fetal Lung 




CNS cancer (neuro;met) 
SK-N-AS 


3.7 


Lung ca. NCI-N4 3 7 


2.3 


CNS cancer (astro) SF-539 1 


0.8 


Lung ca. LX-1 


8.8 


CNS cancer (astro) SNB-75 


0.2 


Lung ca. NCI-H146 


5.6 


CNS cancer (glio) SNB-19 


4.0 


Lung ca. SHP-77 


13.1 


CNS cancer (glio) SF-295 


2.8 


Lung ca. A549 


13.4 


Brain (Amygdala) Pool 


8.2 


Lung ca. NCI-H526 


4.0 


Brain (cerebellum) 


100.0 


Lung ca. NCLH23 


7.3 


Brain (fetal) 


15.9 


Lung ca. NCI-H460 


1.1 


Brain (Hippocampus) Pool j 


8.1 


Lung ca. HOP-62 


4.7 


Cerebral Cortex Pool 


18.3 


T r^n KICK'S IT COO 

JLung ca. iNCi-HDzz 


n 1 
/.i 


Brain (Substantia nigra) 
Pool 


24.0 


Liver 


0.4 


Brain (Thalamus) Pool 


13.7 


Fetal Liver 


1.2 


Brain (whole) 


18.8 


Liver ca. HepG2 


6.0 


Spinal Cord Pool 


3.5 


Kidney Pool 


0.5 


Adrenal Gland 


2.6 


Fetal Kidney 


2.6 


Pituitary gland Pool 


2.3 


Renal ca. 786-0 


10.8 ; 


Salivary Gland 


1.8 


Renal ca. A498 


9.5 


Thyroid (female) 


1.0 


Renal ca. ACHN 


11.0 


Pancreatic ca. CAP AN2 


19.2 


Renal ca. UO-31 


15.9 


Pancreas Pool 


2.5 



Table 56. Panel 1.3D 



Tissue Name 


ReJ. 
Exp.(%) 
Ag2872, 
Run 
161971644 


Rel. 

Exp.(%) 
Ag2872, 
Run 
165721686 


Rel. 
Exp.(%) 
Ag2872, 
Run 
166006455 


Tissue Name 


Rel. 
Exp.(%) 
Ag2872, 
Run 
161971644 ; 


Rel. 
Exp.(%) 
Ag2872, 
Run 
165721686 


Rel. 
Exp.(%) 
Ag2872, 
Run 
166006455 


Liver 

adenocarcinoma ; 


16.7 


22.5 


21.8 


Kidney 
(fetal) 


1.9 


2.9 


1.1 


Pancreas 


0.3 


4.3 


5.4 


Renal ca. 
786-0 


2.8 ! 


17.6 i 


15.0 


Pancreatic ca. 
CAP AN 2 


5.5 


45.4 


44.8 


Renal ca. 
A498 




36.3 


12.7 


Adrenal gland 


0.5 


2.2 


0.3 


Renal ca. ■ 
RXF 393 




85.9 ! 


77.9 
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Thyroid 


0.9 


J.l (| | 1.6 


Renal ca. 
ACHN 


6.4 


17.8 


17.8 


Salivary gland 


0.5 




5.4 


Renal ca. 
UO-31 : 


5.0 


,27.0 . 


19.3 


Pituitary gland 


1.7 


11.4 


4.3 


Renal ca. 
XK-10 


2.3 


7.9 


10.0 


Brain (fetal) 


2.9 




OO 1 

ZZ. l 


Liver 


0.2 


0.0 


1.0 


Brain (whole) 


6.1 


66.4 


79.6 


Liver (fetal) 


0.4 


0.8 


0.7 


Brain (amygdala) 


6.4 


49.7 
i 


41.8 


Liver ca. 

(hepatoblast) 

HepG2 


4.3 , 


14.7 


••18.6 


Brain 

(cerebellum) , 


13.1 


82.9 


100.0 


Lung 


0.5 


6.0 


9.0 


Brain ! 
(hippocampus) 


10.2 


32.8 


29.3 


Lung (fetal) 


0.5 


4.3 


0.7 


Brain (substantia 
nigra) 


2.0 


14.5 


12.2 


Lung ca. 
(small cell) 
LX-1 


3.4 
i 


16.0 


11.6 


Brain (thalamus) j 


7.4 


42.6 


63.3 


Lung ca. 
(small cell) 
NC1-H69 


3.6 I 

t - 


26.1 


26.4 


Cerebral Cortex 


100.0 


84.1 


90.1 


Lung ca. 
(s.cell var.) : 
SHP-77 


11.1 


25.0 


13.8 


Spinal cord 


0.9 


3.1 


3.6 


Lung ca. 
(large 

cell)NCI- i 
H460 


0.3 


4.3 


43.2 


glio/astro U87- 
MG 


13.2 


14.5 


15.7 


Lung ca. j 
(non-sm. cell)' 
A549 


3.9 


8.0 


9.9 


glio/astro U-118- 
MG i 


0.0 


0.0 


0.0 


Lung ca. j 
(non-sxell) j 
NCI-H23 1 j 


3.0 


6.8 


3.6 


astrocytoma 
SW1783 


7.3 


7.9 


10.3 


Lung ca. J 
(non-s.cell) ' 
HOP-62 1 


2.0 


8.7 


6.4 


neuro*; met SK- 
N-AS ' 


1.2 


5.9 


2,2 


Lung ca. ! 
(non-s.cl) 
NCI-H522 ; 


2,6 ■ 


5.0 


3.1 


astrocytoma SF- < 
539 ' 


1.0 


3.7 


6.0 


Lung ca. 
(squam.) SW 
900 


15.6 


58.2 


94.0 


astrocytoma SNB- 
75 


9.7 


100.0 


21.3 


Lung ca. 
(squam.) 
NCI-H596 


2.5 


16.5 


19.3 


glioma SNB-19 ; 


2.5 


6.4 


6.7 


Mammary 
gland 


0.7 1 


6.2 ! 


2.0 


glioma U251 


2.8 


22.2 


8.9 


Breast ca.* 
(pl.ef)MCF- ! 
7 


11.3 


21.9 


16.7 


glioma SF-295 


1.2 


3.7 


6.3 


Breast ca.* 
(pl.ef) MDA- • 
MB-231 


4.1 ; 


32.3 ! 


6.7 


Heart (fetal) ; 


0.5 


0.6 


1.7 


Breast ca.* | 


3.5 i 


17.0 ; 


14.4 j 



284 



BNSDOCID: <WO_02055704A2J_> 



WO 02/055704 



FCT/US02/00554 











fnl ef> T47D 








Heart 


0.1 1 


1.1 


0.0 


Breast ca. 
BT-549 


0.3 




0.8 


Skeletal muscle | 
(tetai) , ; 


0.8 


t 

0.7 


0.3, , 


Breast ca. 1 


1.0 


'o.9 ! 


3.3 


Skeletal muscle 


1.0 


4.4 


2.5 


Ovary 


2.3 1 


1.4 


0.0 


Bone marrow 


0.4 


1.4 


0.9 


Ovarian ca. 
OVCAR-3 j 


3.9 


19.1 


10.2 


Thymus 


2.5 


0.7 


2.3 
i 


Ovarian ca. ' 
OVCAR-4 


1.6 


25.2 " ; 


22.1 


Spleen 


0.7 


• i 1 - 

6.0 


3.3 


Ovarian ca. > 


6.3 


27.7 i 


40.6 


Lymph node 1 


1,4 ' 


8.7 , 


3.2 


Ovarian ca. 
OVCAR-8 


3.3 


14.0 i 


6.2 


Colorectal , 


7.5 


6.7 


6.S 


Ovarian ca. 
IGROV-1 


1.4 


4.6 


11.2 


Stomach 


0.9 


8.3 


2.8 


Ovarian ca.* 
(ascites) SK- 
OV-3 i 


6.4 


46.7 


39.2 


Small intestine ! 


0.8 


H, - 


2.1 


Uterus i 


0.1 


1.1 


0.0 


Colon ca. SW480 


2.5 


6.4 


18.9 


Placenta 


.2.2 


6.ji | 


9.7 


Colon ca.* 
SW620(SW480 ! 
met) 


3.1 


13.8 


9.9 


Prostate 


0.5 


2.6 


2.8 


Colon ca. HT29 


3.2 


3.1 


4.2 


Prostate ca.* 
(bone \ 
met)PC-3 


4.4 


39.2 


16.6 


Colon ca. HCT- ' 
116 


2.9 


15.0 


4.8 


Testis 


2.9 


7.3 


3.7 


fYiInn rn fafn-? 

V^UlUil I'd. VyflVU 


9.0 


11.7 


9.9 


Melanoma 
Hs688(A).T 


0.1 


0.0 


0.0 


Colon ca. 
tiq<nie(OD03 8 66^ 

11 jO Uvl Vi/ V/J yj \J\J J 


4.0 


6.9 


4.8 


Melanoma* 
(met) 

Hs688(B).T 


0.0 


0,0 


0.0 


Colon ca. HCC- 
299S 


3.1 


12.6 


11.1 


Melanoma 
UACC-62 


1.6 


6.3 


9.3 


Gastric ca.* (liver 
met) NCI-N87 


11.7 


54.3 


23.2 


Melanoma 
M14 


0.9 


11.9 


2.1 


Bladder ; 


7.3 


14.7 


6.6 


Melanoma 
LOX IMV1 


0.4 


0.0 


0.4 


Trachea 


2.8 


5.9 


2.0 


Melanoma* 
(met) SK- 
MEL-5 


2.1 


8.4 


6.0 


Kidney 


2.0 


5.8 


0.4 


Adipose 


0.7 


2.4 


2.4 



Table 57. Panel 2.2 



Tissue Name 


Rel. Exp.(%) Ag2872, 
Run 175149214 


Tissue Name 


Rel. Exp.(%) Ag2872, 
Run 175349214 


Normal Colon 


17.4 


Kidney Margin (OD04348) 


100.0 


Colon cancer (OD06064) 


26.6 


Kidney malignant cancer 
(OD06204B) 


14.1 


Colon Margin (OD06064) 


16.2 


Kidney normal adjacent 
tissue (OD06204E) 


8.5 
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Colon cancer (OD06159) 


4.4 


Kidney Cancer (OD04450- 
01) ' 


29.1 

— , . L 


Colon Margin (OD06159) 


11.5 • 


Kidney Margin (OD04450- . 
03) ' 


21.3 


Colon cancer (OD06297-04) 


1.1 


Kidney Cancer 8120613 


2.3 


Colon Margin (OD06297- 
015) 


19.6 


Kidney Margin S120614 


16.7 


CC Gr.2 ascend colon 
(OD03921) 


A A 


jviuiicy cancer 7Uiuj/u 


0.0 


CC Margin (OD03921) 


3.5 


Kidney Margin 9010321 


10.4 


Colon cancer metastasis 
(OD06104) 


0.0 


Kidney Cancer 8 1 20607 


16.3 


Lung Margin (OD06104) 


1 6.6 


Kidney Margin 8120608 


7.5 


Colon mets to lung 
(OD04451-01) 


33.0 


Normal Uterus 


0.0 


Lung Margin (OD0445 1 -02) 


10.8 


Uterine Cancer 06401 1 


2.2 


Normal Prostate 


7.3 


Normal Thyroid 


0.0 


Prostate Cancer (OD04410) 


8.0 


Thyroid Cancer 064010 


5.8 


Prostate Margin (OD04410) 


5.8 


Thyroid Cancer A302152 


20.0 


Normal Ovarv 


2.4 


Thyroid Margin A3D2153 


3.4 


Ovarian cancer (OD06283^ 
03) 


4.2 


Normal Breast 


21.6 


Ovarian Margin (OD06283- 
07^ 


8.2 


Breast Cancer (OD04566) 


8.3 


Ovarinn Panrer 

KJ V al lull Vv u li\^\^± \J \J t uuu 


12.2 


Breast Cancer 1 024 


22.5 


Ovarian cancer (OD06 145) 


4.3 


Breast Cancer (OD04590- 
01) 


41.2 


Ovarian Margin (OD06145) \ 


8.7 


Breast Cancer Mets 
(OD04590-03) 


26.1 


03) 


20.2 


Breast Cancer Metastasis 
(OD04655-05) 


46.0 


Ovarian Margin (OD06455- ! 
07) 


0.0 


Breast Cancer 064006 


15.8 


Normal Lung 


7.6 


Breast Cancer 91 00266 


9.6 


Invasive poor diff. lung 
adeno (ODO4945-01 


/ /.J 


jt>reast iviargm y iuuzoj 


1 7 


Lung Margin (ODO4945-03) 


14.1 


Breast Cancer A209073 


9.9 


Lung Malignant Cancer 
(OD03126) 


16.0 


Breast Margin A2 090734 


17.2 


Lung Margin (OD03 126) 


4.3 


Breast cancer (OD06083) 


50.3 


Lung Cancer (OD05014A) 


7.5 


Breast cancer node 
metastasis (OD06083) 


42.0 


Lung Margin {UVvDviqii) 


1 O.J. 


INUIiJldi JL/iVCI 




Lung cancer (OD06081) 


23.8 ; 


Liver Cancer 1026 


3.2 


Lung Margin (OD06081) 


12.3 


Liver Cancer 1 025 


10.4 


Lung Cancer (OD04237-01) 


9.9 


Liver Cancer 6004-T 


0.3 


Lung Margin (OD04237-02) 


25.5 


Liver Tissue 6004-N 


6.8 


Ocular Melanoma Metastasis ; 


4.3 


Liver Cancer 6005-T 


13.6 


Ocular Melanoma Margin 
(Liver) 


3.7 


Liver Tissue 6005-N 


8.8 


Melanoma Metastasis 


5.8 


Liver Cancer 064003 


21.2 
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Melanoma Margin (Lung) 


15.1 


Normal Bladder 


17.4 • 


Normal Kidney i 


"1.8 


Bladder Cancer 1023 1 


• ' 4.6 


Kidney Ca, Nuclear grade 2 
(OD04338) 


, 41.8 


Bladder Cancer A302173 


r 22.4 


Kidney Margin (OD04338) 


10.4 


Normal Stomach 


27.7 


Kidney Ca Nuclear grade 1/2 
(OD04339) 


52.1 


Gastric Cancer 9060397 


3.6 


Kidney Margin (OD04339) 


15.3 


Stomach Margin 9060396 


14.2 


Kidney Ca, Clear cell type 
(OD04340) 


0.0 


Gastric- Cancer 9060395 


'5.1,, 


Kidney Margin (OD04340) 


16.8 


Stomach Margin 9060394 


10.2 


Kidney Ca, Nuclear grade 3 
(OD04348) 


0.0 


Gastric Cancer 064005 


12.1 



Table 58 . Panel 2D 



Tissue Name 


JRel. Exp.(%) Ag2872, 


Tissue Name 


Rel. Exp.(%) Ag.2872, 
Run 161971795 


WAnvin 1 C 1 mi 

iNLJIiluii V_/UJUli 


30.8 


Kidnev Margin 81 206>08 


2.9 


CC Well to Mnd Diff 

(OD03866) 


, ,13.0 


Kidney Cancer 8120613 


7.6 


CC Margin (OD03 8 66) 


5.9 


Kidney Margin 81^0614 


15.3 


(OD03868) 


16.2 


Kidney Cancer 9010320 


10.9 


CC Margin (OD03868) 


3.4 


Kidney Margin 9010321 


23.0 




16.3 


TvJr»rmnl 1 Ttpni*; 


0.0 




10.9 


Uterus Cancer 06401 1 


23.5 


CC Gr.2 ascend colon 
(OD03921) 


20.4 


Normal Thyroid 


4.0 


CC Margin (0DO3921) 


4.8 


Thyroid Cancer 064010 


14.5 


CC from Partial Hepatectomy 
(ODO4309) Mets 


21.6 


Thyroid Cancer A3 02 152 


15.0 


Liver Margin (ODO4309) 


6.2 


Thyroid Margin A302153 


11.7 


Colon mets to lung (OD04451- 
01) 


24.0' 


Normal Breast 


21.3 


Lung Margin (OD0445 1 -02) 


9.5 


Breast Cancer (OD04566) 


25.9 


Normal Prostate 6546-1 


1.8 ' 


Breast Cancer (OD04590- 
01) 


42.3 


Prostate Cancer (OD044 1 0) 


19.1 


Breast Cancer Mets 
(OD04590-03) 


39.2 


Prostate Margin (OD04410) 


15.2 


Breast Cancer Metastasis 
(OD04655-05) 


40.9 


Prostate Cancer (OD04720-01) 


17.2 


Breast Cancer 064006 


15.9 


Prostate Margin (OD04720-02) 


18.3 


Breast Cancer 1 024 


24.8 


Normal Lung 061010 


31.0 


Breast Cancer 9100266 


23.8 


Lung Met to Muscle 
(OD04286) 


12.1 


Breast Margin 9100265 


7,9 


Muscle Margin (OD04286) 


4.6 


Breast Cancer A209073 


23.2 


Lung Malignant Cancer 
(OD03126) 


35.4 


Breast Margin A2090734 


17.2 


Lung Margin (OD03126) 


24.8 


Normal Liver 


4.2 


Lung Cancer (OD04404) 


43.2 


Liver Cancer 064003 


10.1 
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Lung Margin (OD04404) 


14.2 


Liver Cancer 1025 


3.3 


Lung Cancer (OD04565) 


26.6 


Liver Cancer 1026 


5.3 


Lung Margin (OV04565) 


8J 


Liver Cancer 6004-T 


4.6 


Lung Canqer (OD04237-01) 


25.2 


Liver Tissue 6004-N 


8.5 


Lung Margin (OD04237-02) 


16.6 


Liver Cancer 6005-T 


5.6 


Ocular Mel Met to Liver 
(ODO4310) 


7 o 


Liver i issue ouuj-in 


1 .0 


Liver Margin (ODO43I0) 


9.2 


Normal Bladder 


28.7 


Melanoma Mets to Lung 
(OD04321) 


0 A 


JtsiauGer cancer 


1 <i 1 

10.1 


Lung Margin (OD04321) 


32.1 


Bladder Cancer A302173 


27.4 


Normal Kidney 


1 32.5 


Bladder Cancer 
(OD04718-01) 


32.1 


Kidpey Ca, Nuclear grade 2 
(OD04338) 


JU.O 


Bladder Normal Adjacent 
(OD047 18-03) 


i 

A A 


Kidney Margin (OD04338) 


32.8 


Normal Ovary 


2.9 


Kidney Ca Nuclear grade 1/2 
(OD04339) 


31.0 


uvarian cancer uo^uuo 


1 7 1 




23.3 


Ovarian Cancer 
(OD04768-07) 


100.0 


Kidney Ca, Clear cell type 
(OD04340) 


5.5 


Ovary Margin (OD04768- 
08) 


1.5 


Kidney Margin (UL/U4.54U) 


oo ^ 


Noi*mal Stomach 


1 S R 


Kidney Ca, Nuclear grade 3 
(OD04348) 


0.5 


Gastric Cancer 9060358 


2.5 


Kidney Margin (OD04348) 


27.9 


Stomach Margin 9060359 


7.5 


Kidney Cancer (OD04622-01) 


6.0 


Gastric Cancer 9060395 


8.8 


Kidney Margin (OD04 622-03) 


4.5 j 


Stomach Margin 9060394 


14.4 


Kidney Cancer (OD04450-01) 


16.5 


Gastric Cancer 9060397 


33.0 


Kidney Margin (OD04450-03) 


19.2 


Stomach Margin 9060396 


7.9 


Kidney Cancer 8120607 


11.5 


Gastric Cancer 064005 


23.0 



Table 59 . Panel 3D 



Tissue Name 


Re). Exp.(%) 
Ag2872, Run 
164543502 


Rel. Exp.(%) 
Ag2872, Run 
164828587 


Tissue Name 


Rel. Exp.(%) 
Ag2872 5 Run 
164543502 


Rel. Exp.(%) 
Ag2872, Run 
164828587 


Daoy- 

Medulloblastoma 


2.5 


1.9 


Ca Ski- Cervical 
epidermoid carcinoma 
(metastasis) 


8.2 


9.7 


TE671- 

Medulloblastoma 


1.5 


2.0 


ES-2- Ovarian clear cell 
carcinoma 


0.6 


0.5 


D283 Med- 
Medulloblastoma 


6.3 


8.8 


Ramos- Stimulated with 
PMA/ionomycin 6h 


3.0 


3.3 


PFSK-1- Primitive 
Neuroectodermal >- 


13 


1.6 


Ramos- Stimulated with 
PMA/ionomycin 14h 


3.0 


3.8 


XF-498- CNS 


0.3 


0.4 


MEG-01- Chronic 
myelogenous leukemia 
(megokaryoblast) 


0.4 


0.8 


SNB-78- Glioma 


0.0 


0.0 


Raji- Burkitt's lymphoma 


1.2 


1.1 


SF-268- Glioblastoma: 


0.7 


0.9 


Daudi- Burkitt's 
lymphoma 


2.2 | 


2.4 


T98G- Glioblastoma 


0.7 


1.2 


U266- B-cell 


1.5 


1.1 
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plasmacytoma 






SK-N-SH- 

in cm u DJds>ionia 

(metastasis) 


1 0 


0 ft 


CA46- Burkitfs 
lymphoma 


1 A 


ft Q 


SF-295- Glioblastoma 


0.4 


0.6 


JvJL- non-iioagKm s Jts-cejj 
lymphoma 


0.7 


0.9 


Cerebellum 


7.0 


10.4 


Jivi i - pre-r>-cen 
lymphoma 


1.2 


1.7 




7 Q 


n i 
1Z. J 


Jurkat- T cell leukemia 


1 i 


1 o 


NCI-H292- 
Mucoepidermoid lung 
carcinoma 


1ft Q 


z-O 


TF- 1 - trythroleukemia 


ft 0 

U.z 


0.2 


T^A /TO 11/1 C~->~11 ,-.^11 

jJMiS-i 14- bmall cell 
lung cancer 


1.6 


1 1.7 


"LJl TT TO T* 11 

HU 1 78- T-cell 
lymphoma 


0.9 


1.6 


u?vip-/y- Small cell 
lung cancer 


100.0 


100.0 


U937- Histiocytic 
lymphoma 


0.6 


1 

1.4 


NCI-H146- Small cell 
lung cancer 


8.5 


8.9 


KU-812- Myelogenous 
leukemia 


0.1 


0.2 


NCI-H526- Small cell 
lung cancer 


9.7 


13.6 


769-P- Clear cell renal 
carcinoma 


1.7 


2.2 


NCI-N417- Small cell 
lung cancer 


2.5 


2.9 


Caki-2- Clear cell renal 
carcinoma 


1.8 


2.6 


NCLH82- Small cell 
lung cancer 


1.5 


1.7 




SW 839- Clear cell renal . 
carcinoma | 


2.1 


2.2 


NCI-H157- 
Squamous cell lung 
cancer (metastasis) 


7.4 


10.1 


G401- Wilms* tumor 


0.S 


1.6 


NCI-HI 155- Large 
cell lung cancer 


8.7 


10.8 


Hs766T- Pancreatic 
carcinoma (LN 
metastasis) 


2.8 j 


3.1 


NCI-H 1299- Large 
cell lung cancer 


5.1 


5.1 


CAPAN-1- Pancreatic 
adenocarcinoma (liver 
metastasis) 


3.3 


3.4 


NCI-H727- Lung 
carcinoid 


6.2 


0. / 


SU86.86- Pancreatic 
carcinoma (liver 
metastasis) 


5.7 


7.4 


NCLUMC-ll-Lung 
carcinoid 


17.8 


15.4 


BxPC-3- Pancreatic 
adenocarcinoma 


7.1 


10.0 


LX-1- Small cell lung 
cancer 


5.1 


4.3 


HPAC- Pancreatic 
adenocarcinoma 


4.4 


3.2 


Colo-205- Colon 
cancer 


3.5 


4.5 


MIA PaCa-2- Pancreatic 
carcinoma 


0.8 


1.6 


KM 12- Colon cancer 


5.2 


6.0 


CFPAC-1- Pancreatic 
ductal adenocarcinoma 


30.4 


28.5 


KM20L2- Colon 
cancer 


1.3 


0.9 


PANC-1- Pann patir 

epithelioid ductal 
carcinoma 


3.8 


3.7 


NCI-H716- Colon 
cancer 


10.8 


13.9 


T24- Bladder carcinma 
(transitional cell) 


2.5 


2.4 


SW-48- Colon 
adenocarcinoma 


1.6 


1.3 


5637- Bladder carcinoma 


5.7 


6.0 


SW111 6- Colon 
adenocarcinoma 


2.9 


2.9 


HT-1197- Bladder 
carcinoma 


6.0 ; 


6.3 


LS 174T- Colon 


6.2 


6.9 


UM-UC-3- Bladder 


0.8 


0.6 
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<HJCIlUt>«I OII1UI1 Jd 


i » 


1 1 ; 


carcinma (transitional 
cell) 






SW-948- Colon 

a H p n n c a r r i n run a 


O.y 


0.6 

1 


A204- 

Rhabdomyosarcoma 


0.9 

, r 


L2 


SW-480- Colon , 

^ f*n rn~ a rr i ri n ma 


2.2 


0.2 


HT-1080- Fibrosarcoma 


8.2 


11.8 


NCI-SNU-5- Gastric 


2.8 


3.4 


MG-63- Osteosarcoma 


0.0 


0.0 


KATO III- Gastiic 
carcinoma 


3.3 


6.3 


SK-LMS-1- • 
Leiomyosarcoma (vulva) 


1.7 


1.8 


NCI-SNU-16- Gastric 
carcinoma 


3.9 


j 

3.8 


O TO TJOA 

Rhabdomyosarcoma (met 
to bone marrow) 


1.4 


0.7 


NC1-SNU-1- Gastric ! 
carcinoma 


6.0 


1 8.0 


A431- Epidermoid 

uui 111 \Jlxii.\ 


4.1 I 


5.6 


RF-1- Gastric , ' 
adenocarcinoma 


1.6 


2.3 .; 


WM266-4- Melanoma 


2.0 


2.1 


RF-4 8- Gastric 
adenocarcinoma 




1 k 

1.0 


DU 145- Prostate 

ta^ CJJLiUllJicl ^UlollI 

metastasis) 


0.3 


0.1 


MT<n\I-4S- fiastric 

carcinoma 


7.0 j 


7.6 


MDA-MB-468- Breast 
adenocarcinoma 


4.6 , 


7.5 


NCI-N87- Gastric ' 
carcinoma 


4.4 


4.8 ( ; 


SCC-4- Squamous cell , 
carcinoma of tongue 


0.8 


0.4 


OVCAR-5- Ovarian 
carcinoma 


0.9 


1.4 


SCC-9- Squamous cell 
carcinoma of tongue 


0.7 


0.5 . 


RL95-2- Uterine 
carcinoma 


2.3 


3.0 


SCC-15- Squamous cell 
carcinoma of tongue 


0.4 


0.3 


HelaS3- Cervical 
adenocarcinoma 


1.4 


2.2 


GAL 27- Squamous cell 
carcinoma of tongue 


3.1 


3.4 



Table 60 . Panel 4. ID 



Tissue Name 


Rel. Exp.(%) Ag4847, 
Run 223335762 


Tissue Name 


Rel. Exp.(%) Ag4847, 
Run 223335762 


Secondary Thl act 


12.2 


HUVEC IL-lbeta 


0.0 


Secondary Tli2 act 


14.8 


HUVEC IFN gamma j 


0.0 


Secondary Trl act 


9.5 


HUVEC TNT alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


2.6 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


8.7 


HUVEC IL-11 


0.0 


Secondary Trl rest 


9.9 


Lung Microvascular EC none 


0.0 


Primary Thl act 


2.1 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


14.2 


Microvascular Dermal EC none 


0.0 


Primary Trl act 


2.8 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


2.7 


Bronchial epithelium TNFalpha j 
+ ILlbeta 


51.8 


Primary Th2 rest 


3.6 


Small airway epithelium none 


28.7 


Primary Trl rest 


19.1 


Small airway epithelium } 
TNFalpha + IL-lbeta 


52.5 


CD45RA CD4 lymphocyte 
act 


5.3 


Coronery artery SMC rest 


0.0 
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CD45RO CD4 lymphocyte 
act 




Coronery artery SMC TNFalpha 
+ iL-lbeta 


0.1 


CD8 lymphocyte act 


3.9 


Astrocytes rest 


16.7 


Secondary CD8 
lymphocyte rest ' 


o.o 




9.7 


Secondary CDS 
lymphocyte act 


2.4 


KU-8 12 (Basophil) rest 


0.3 


CD4 lymphocyte none 


0.5 


KU-8 12 (Basophil) 
PMAyionomycin 


, 0.2 


2ry Thl/Th2/Trl„anti- 
CD95 CH11 


7.0 


CCD 1 106 (Keratinocytes) none 


68.8 ' : 


LAK cells rest 

i 


1.9 


CCD1 106 (Keratinocytes) 

1 iNPaipild ' 1L>- I UCLd 


29.5 


LAK cells 1L-2 


— — — 1 1 

O.J 




4.1 


LAK cells IL-z+lL-l/ 




MPT ViOQO nc\r\e> 
iNv^i-nzyz nunc 


27.7 


LAK cells IL-2+IFN 
gamma 


4.0 ] 


NCI-H292 IL-4 


87.1 


T A V ^ ^ 1 1 „ T T TT 1C 

LAK cells LL-2+ JLL-16 


D.J 


MPT T49Q? TT Q 


81.2 


LAK cells 

PA/I A /innfimvfMn 


3.? • j 


NCI-H292 IL-13 \ 


74.7 


NK Cells IL-2 rest 


5.1 


NC1-H292 1FN gamma 


1 35.4 


Two Way MLR 3 day 


1.6 


HPAEC none 


0.0 


Two Way MLR 5 day 


1.3 ' i 


HP AEC TNFalpha + IL-1 beta 


0.0 


Two Way MLR 7 day 


1.0 


Lung fibroblast none 


0 d 


PBMC rest 


3.8 


Lung fibroblast TNF alpha + IL-. 
1 beta j 


0.5 


PBMC PWM 


5.6 i 


Lung fibroblast IL-4 


0.0 


PBMC PHA-L 


13.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) none 


15.1 


Lung fibroblast IL-13 


0.0 


Ramos (B cell) ionomycin 


15.5 


Lung fibroblast IFN gamma 


0.0 


B lymphocytes PWM 


9.2 


Dermal fibroblast CCD 3 070 rest 


0.0 


B lymphocytes CD40L 
and IL-4 


42.0 


Dermal fibroblast CCD 1070 
Lrir aipna 


6.2 


EOL-1 dbcAMP 


0.0 


i JCI nidi llDIUOidbl ^\_-JL>lUfvy i.\~t 

1 beta 


0.0 


EOL-1 dbcAMP 
PMA/ionomycin 


0.0 


Dermal fibroblast IFN gamma 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IL-4 


0.0 


Dendritic cells LPS 


0.0 


Dermal Fibroblasts rest 


0.0 


Dendritic cells anti-CD40 


0.0 


Neutrophils TNFa+LPS 


0.0 


Monocytes rest 


0.0 


Neutrophils rest 


0.0 


Monocytes LPS 


0.0 


Colon 


0.7 


Macrophages rest 


0.0 


Lung 


8.8 


Macrophages LPS 


0.0 


Thymus 


32.5 


HUVEC none 


0.0 


Kidney i 


100,0 


HUVEC starved 


0.0 






Table 61. Panel 4D 


Tissue Name 


Re). Exp.(%) Ag2872, 
Run 159776802 


Tissue Name 


Reh Exp.(%) Ag2872, 
Run 159776802 


Secondary Thl act 


6.7 


HUVEC IL-lbeta 


0.0 
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secondary J nz act 


10.1 


|JriUVbC if JN gamma 


U.U 


Secondary Trl act 


10.7 


HUVEC TNF alpha + IFN 
gamma 


0.0 


Secondary Thl rest 


2.1 


HUVEC TNF alpha + IL4 


0.0 


Secondary "Th2 rest 


4.2 


HUVEC IL-11 


0.0 


Secondary Trl rest 


4.4 


Lung Microvascular EC none 


0.3 


Primarv Thl act 

x iiiiiai y till uvt 


6.2 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


0.0 


Primary Th2 act 


6.0 


Microvascular Dermal EC none 


0.1 


i j uijdi y ill «vi 


5.7 


Microsvasular Dermal EC 
TNFalpha -ML- 1 beta 


0.0 


Primarv Thl rf*<:t 
x i xxiiai y 1111 lvoL 




Bronchial epithelium TNFalpha 
+ ILlbeta 


2.7 


primary Th2 rest 


20.9 


Small airway epithelium none 


18.9 , 


Primary Trl rest 


18.9 


Small airway epithelium 
TNFalpha + IL-lbeta 


100.0 


CD45RA CD4 lymphocyte 
act 


6.2 


Coronery artery SMC rest 


0.1 


CD45RO CD4 lymphocyte 


11.0 


Coronery artery SMC TNFalpha 


0.0 


act 


+ IL-lbeta ; 


CDS lymphocyte act 


13.4 


Astrocytes rest 


17.9 


Secondary CD 8 
lymphocyte rest 


10.7 


Astrocytes TNFalpha + IL-lbeta; 


16.0 


Secondary CDS 
lymphocyte act 




VII %M fRa«:fmhiTl rfxiA ! 


0.3 


CD4 lymphocyte none 


z.o 


KU-812 (Basophil) 
PMA/ionomycin 


1.2 


2ryThl/Th2/Trl anti- 
CD95 CH11 


8.5 


CCD 1 106 (Keratinocytes) none ; 


29.5 


LAK cells rest 


5.0 


CCD1 1 06 (Keratinocytes) 
TNFalpha + IL-lbeta 


3.4 


LAK cells IL-2 


10.7 


Liver cirrhosis \ 


4.2 


LAK cells IL-2+IL- 1 2 


8.4 


Lupus kidney 


5.2 


LAK cells IL-2+IFN 

£ CtlliiJiU 


10.3 


NCI-H292 none j 


55.1 


LAK cells IL-2+ IL-18 


8.0 


NCI-H292 IL-4 


77.4 


LAK cells 

r ivjl/v JUiiuiiiy^JUii 


4.5 


NCI-H292 IL-9 


70.2 


NT<r rpl1<; TT -9 rest 


5.7 


NCI-H29'? IT -13 


37.1 


i wu vv ay ivxi-jj\. j utiy 


6.0 


NCI-H792 IFN gamma 


27.4 


Twn Wav MT R 5 dav 
i W\J vv ay ivii_/iv uay 


4.4 


HPAEC none 


0.1 


Two Way MLR 7 day 


5.9 


HPAEC TNF alpha + IL-1 beta 


0.0 


1 JdMU rest 




j^uiig iiurouiasi noiie 




PBMC PWM 


26.4 


Lung fibroblast TNF alpha + IL- 
1 beta 


1.4 


PBMC PHA-L 


19.2 


Lung fibroblast IL-4 


1.2 


Ramos (B cell) none 


11.7 1 


Lung fibroblast IL-9 


0.8 


Ramos (B cell) ionomycin 


56.3 


Lung fibroblast IL-13 


0.4 


B lymphocytes PWM 


31.6 


Lung fibroblast IFN gamma 


0.7 


B lymphocytes CD40L 
and IL-4 


22.5 


Dermal fibroblast CCD1070 rest 1 


1.7 
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EOL-1 dbcAMP 


0.8 


Dermal fibroblast CCD 1 070 
TNF alpha 


i.y 


EOL-1 dbcAMP 
PMA/ionomycin 


0.6 


Dermal fibroblast CCD 1070 IL- , 
1 beta ■ 1 ■ 


0.4 


Dendritic cells none 


0.7 


Dermal fibroblast IFN gamma 


0.3 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 


2.0 


Dendritic cells anti-CD40 


0.1 


IBD Colitis 2 


0.8 


Monocytes rest 


03 


IBD Crohn's 


1.6 


Monocytes LPS 


0.4 


Colon 


10.4 


Macrophages rest 


1.0 


Lung i 1 


6.8 


Macrophages LPS 


0.8 


Thymus 


21.2 


HUVEC none 


0.0 


Kidney 


32.5 


HUVEC starved 


0.3 







Table 62. Panel 5 Islet 



J JsMlt. jL1»1I1JC 


Rel. Exp.(%) 
Ag2872, Run 
237228677 


Tissue Name 


j<ei. ii.xp.( /o ) 
Ag2872, Run 
237228677 


02go_adipose 


5.1 ! 


94709JDonor 2 AM - A^adipose 


1.8 


Q7476 Patient-07sk skeletal 
muscle 


2.7 


94710JDonor 2 AM - B_adipose 


0.0 


07477 PatiVnt-07ut litems 


0.0 


94711 Donor 2 AM'- C adipose 


0.0 


07pl_placenta 


39.5 


94712_Donor 2 AD - A_adipose 


0.0 


001 67 Ravpr Patient 1 


97.9 j 


94713 Donor 2 AD -B adipose 


0.0 


974S2JPatient-08ut_uterus 


0.0 


94714_Donor 2 AD - C_adipose 


0.0 


97483_Patient- 
08pl_placenta 


21.9 


94742_Donor 3 U - AJMesenchymal : 
Stem Cells 


0.0 


97486_Patient-09sk_skeletal ! 
muscle 


2.5 


94743 JDonor 3 U - B_Mesenchymal 
Stem Cells 


0.0 


97487_Patient-09ut_uterus 


0.0 


94730JDonor 3 AM - A_adipose 


0.0 


97488JPatient- 
09pl_placenta 


19.3 


94731JDonor 3 AM - B_adipose 


0.0 


97492 Patient-1 Out uterus 


1.9 


94732 JDonor 3 AM - C_adipose 


0.0 


97493JPatient- 

1 0pl__placenta ^ 


100.0 


94733 _Donor 3 AD - A_adipose 


0.0 


97495 JPatient- 
llgo_adipose 


4.0 


94734 JDonor 3 AD - B_adipose 


0.0 


97496_Patient-l lsk_skeletal 
muscle 


3.2 


94735__Donor 3 AD - C^adipose 


0.0 


97497 Patient-1 lut uterus 


1.7 


77138JLiverJHepG2iintreated 


46.0 


97498JPatient- 
1 lpl_jplacenta 


23.8 


73556_Heart_Cardiac stromal cells 
(primary) 


0.0 


97500_Patient- 
12go_adipose 


1.6 


81735_Small Intestine 


13.5 


97501_Patient-12sk__sketetal 
muscle ; 


4.0 


72409JKjdneyJProximal Convoluted 
Tubule 


17.1 


97502 Patient- 12ut_uterus 


0.0 


82685__Small intestine^Duodenum 


1.4 


97503 JPatient- 
12pl_placenta 


20.3 


90650_Adrenal_Adrenocortical 
adenoma 


1.6 


94721 Donor2U- 


0.0 


72410 Kidney JHRCE 


90.1 
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A_Mesenchymal Stem Cells 








94722_Donor 2 U - 
B_Mesenchymal Stem Cells 


0.0 


72411JCioneyJHRE 


48.6 


94723_Donar2U- 
C_Mesenchymal gtem Cells 


0.0 


73 1 39 JUterus JUterine smooth 
rmtecle cells 


r 

2.9 



Table 63 . Panel CNS 1 



Tissue Name 


ReL Exp.(%) Ag2872, Run 
171669734 


Tissue Name 


Rel. Exp.(%) Ag2872, Run 
17166^734 


BA4 Control 


26.1 


BA17PSP' , 


36.3 


BA4 ContTol2 


50.0 ' 


BA17PSP2 


10.2 


BA4 Alzheimer's2 


9.3 


Sub Nigra Control 


8.8 




21.5 


Sub Nigra Control2 


21.5 


BA4 Parkinson's2 


. 1 100.0 


Sub Nigra Alzheimer's2 


6.8 


BA4 Huntington's 


25.0 


Sub Nigra Parkinson's2 


19.9 


BA4 

Huntington's2 


15.7 


Sub Nigra Huntington's 


36.9 


BA4 PSP 


18.7 


Sul? Nigra j 
Huntington's2 


— t ■■ — 

15.5 


BA4 PSP2 


42.3 1 


Sub Nigra PSP2 


4.7 


BA4 Depression 


13.7 


Sub Nigra Depression 


1 4.1 


BA4 Depression2 


11.4 


Sub Nigra Depression2 « 


6.2 


BA7 Control 


45.1 


Glob Palladus Control 


2.5 


BA7 Control 


38.2 


Glob Palladus Control2 


7.7 


BA7 Alzheimer's2 


9.7 


Glob Palladus 
Alzheimer's 


4.5 


BA7 Parkinson's 


16.5 


Glob Palladus 


4.1 


BA7 Parkinson's2 


43.8 


Glob Palladus 
Parkinson's 


33.2 


BA7 Huntington's 


52.5 


Glob Palladus 
Parkinson's2 


5.1 


BA7 

Huntington's2 


-3D. 3 


oioD Jranaaus tor 


1.0 


BA7 PSP 


40.1 


Glob Palladus PSP2 


3.7 


BA7 PSP2 


27.7 


Glob Palladus 
Depression 


2.4 


BA7 Depression 


4.2 


Temp Pole Control 


16.6 


BA9 Control 


28.7 


Temp Pole Control2 


47.0 


BA9 Control2 


71.2 


Temp Pole Alzheimer's 


8.1 


BA9 Alzheimer's 


8.1 


Temp Pole Alzheimer's2 


8.3 


BA9 Alzheimer's2 


18.8 


Temp Pole Parkinson's 


27.0 


BA9 Parkinson's 


. 283 


Temp Pole Parkinson's2 ; 


28.9 


BA9 Parkinson's2 


60.3 


Temp Pole Huntington's ' 


36.6 


BA9 Huntington's 


40.9 


Temp Pole PSP 


6.8 


BA9 

Huntington's2 


19.5 


Temp Pole PSP2 


8.1 


BA9 PSP 


21.5 


Temp Pole Depression2 


9.2 


BA9 PSP2 


6.0 


Cing Gyr Control 


49.7 


BA9 Depression 


7.7 


Cing Gyr Control2 


32.1 


BA9 Depression2 


15.5 


Cing Gyr Alzheimer's 


11.0 
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BA17 Control 


38.2 


Cing Gyr Alzheimer's2 


10.1 


BA17 Control2 


71.2 


Cing Gyr Parkinson's 


13.5 


BA17 

Alzheimer's 2 


10.2 


Cing Gyr 'Parkinson^ ' 


20.7 


BA17 Parkinson's 


29.1 


Cing Gyr Huntington's 


41.8 


BA17 

Parkinson's2 


51.4 


Cing Gyr Huntington's2 


8.6 


BA17 ; 
Huntington's 


37.6 


Cing Gyr PSP 


9.7 . 


BA17 

Hnnrington's2 


18.6 


Cing Gyr PSP2 


6,3 


BA17 Depression 


5.6 ! 


Cing Gyr Depression 


10.3 


BA17 Depression2 




Cing Gyr Depression2 


5.9 



1 , CNS_ncurodegeneration^vl.O Summary: Ag2872 This panel confirms the 1 
expression of the CG56071-01 gene at moderate level in the CNS in an independent group of 
, patients. However, no differential expression was found between Alzheimer's disease and 
control postmortem brains in this experiment. Please see Panel 1 .5 for a discussion of the 



5 potential utility of this gene in the central nervous' system. 

General_screerjing_jpanel_vl .5 Summary: Ag4847 Expression of the CG56071-01 
gene is highest in the cerebellum (CT = 25.4). Thus, the expression of this gene could be used 
to distinguish cerebellar brain tissue from other samples in the panel. This gene is also 
expressed at more moderate levels in other central nervous system tissues, including 

1 0 amygdala, hippocampus, cerebral cortex, substantia nigra, thalamus and spinal cord (CTs = 
27-30). This gene encodes a protein with homology to mixed lineage kinase 2. Mixed lineage 
kinase 2 is a mammalian protein kinase that activates stress-activated protein kinases/c-jun N- 
terminal kinases (SAPK/JNKs) through direct phosphorylation of their upstream activator, 
SEK1/JNKK. MAP kinase signaling pathways are important mediators of cellular responses to 

15 a wide variety of stimuli. Signals pass along these pathways via kinase cascades in which three 
protein kinases are sequentially phosphorylated and activated, initiating a range of cellular 
programs including cellular proliferation, endocrine, immune and inflammatory responses, and 
apoptosis. Furthermore, mixed lineage kinases have been implicated in neuronal apoptosis 
(ref. 1). Therefore, therapeutic downregulati on/antagonism of this gene may slow neuronal 

20 apoptosis in diseases such as Alzheimer's, Huntington's and Parkinson's diseases. 

This gene also shows substantial expression in cell lines derived from ovarian cancers 
when compared to normal ovary. Thus, therapeutic modulation of this gene or its protein 
product, through the use of small molecule drugs, antibodies or protein therapeutics, might be 
of benefit in the treatment of ovarian cancer. 
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In addition, this gene is expressed at low to moderate levels in endocrine and metabolic 
tissues including adipose', adrenal gland, liver, pancreas, pituitary gland, skeletal muscle and 
thyroid. Thus, therapeuticmodulktion of this gene or its protein product may be beneficial in 
the treatment of endocrine/metabolic-related disorders, such- as obesity and diabetes. 
5 Interestingly, this gene is expressed at much higher levels in adult skeletal muscle (CT — 30.5) 
than in fetal skeletal muscle (CT = 35), suggesting that expression of this gene may be used to 
differentiate adult from fetal skeletal muscle (Xu Z, Maroney AC, Dobrzanski P, Kukekov, 
NV, Greene LA.The MLK family mediates c-Jun N-tenninal kinase activation in neuronal 
apoptosis. Mol Cell Biol 2001 Jul;2,l(14):4713-24), 

10 Panel 1.3D Summary: Ag2872 Expression of the CG56071-01 gene was assessed in 

three independent experiments using the same probe/primer pair. Two of the three runs had 
good concordance; the third experiment was performed using a different machine and may 
explain the observed differences irj expression. Overall this gene shows highest expression in 
samples derived from brain tissue, either normal tissue or cell lines derived from malignant 

15 brain tissue. Please see panel Panel 1.5 for ,a discussion of utility it) the central nervous system. 

In addition, there is substantial expression of this gene in a number of cancer cell lines, 
including ovarian cancer, breast cancer and renal cancer cell lines. Thus, the expression of this 
gene could be used to distinguish these samples from* the other samples on this panel. 
Moreover, therapeutic modulation of the activity of this gene or its protein product, through 

20 the use of small molecule drugs, antibodies or protein therapeutics, might be of benefit in the 
treatment of ovarian, breast or renal cancer. 

There is limited expression of this gene in endocrine/metabolic related tissues. Low 
expression of this gene is seen in adipose, pancreas, reproductive tissues (testes and ovaries) 
and skeletal muscle. Therefore, therapeutic modulation of this gene and/or its protein product 

25 may prove useful in the treatment of different endocrine/metabolic diseases, such as diabetes 
and obesity. Please refer to Panel 1.5 for a synopsis of the function of the MLK2 homolog. 

Panel 2.2 Summary: Ag2872 Expression of the CG56071-01 gene is highest in a 
sample derived from normal kidney tissue adjacent to a kidney cancer (CT = 31 .2). In 
addition, there appears to be substantial expression of this gene in samples derived from breast 

30 cancers. Thus, expression of this gene could be used to distinguish normal kidney tissue from 
other tissues in the panel. Moreover, therapeutic modulation of the activity of this gene or its 
protein product, through the use of small molecule drugs, antibodies or protein therapeutics, 
might be of benefit in the treatment of breast cancer. 
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Panel- 2D Summary: Ag2872 Expression of the CG56071-01 gene is highest in a 
sample derived from an 6varian cancer (CT = 28.4). Thus, expression of this gene could be 
used to distinguish ovarian cancer tissue from the other tissues in the panel. In addition, there 
appears to be substantial expression of this gene in samples derived from breast cancers and 
5 well as a small but appreciable difference in expression between a set of colon cancers and 
their respective normal adjacent tissues. Therefore, therapeutic modulation of the.activity of 
this gene or its protein product, through the use of small molecule drugs, antibodies or protein 
therapeutics, might be of benefit in the treatment of breast cancer, ovarian cancer or colon 
cancer. 1 

10 Panel 3D Summary: Ag2872 The expression of the CG56071-01 gene was assessed 

in two independent runs in Panel 3D using one probe/primer pair. The two runs showed ' 
excellent concordance. This gene shows highest expression in a sam'ple derived from a small 
cell lung cancer derived cell line (<pT = 26.1). In addition, there is substantial expression of 
this gene in two other lung cancer derived cell lines and a pancreatic cancer derived cell line. 

1 5 Thus, the expression of this gene could be used to distinguish this small cell lung cancer cell 
line from other samples in the panel. Moreover, therapeutic modulation of the activity of this 
gene or its protein product, through the use of small molecule drugs, antibodies or protein 
therapeutics, might be of benefit in the treatment of lung cancer. 

Panel 4.1D Summary: Ag4847 Expression of the CG56071-01 gene is highest in 

20 kidney (CT = 28.3). This gene is also highly expressed in small airway epithelium treated with 
TNF-a and IL-lb, and to a lower extent in the same non treated tissue and also in the 
mucoepidermoid cell line H292 upon treatment with the Th2 cytokines IL-4 and 11-9, 
cytokines that are responsible for increasing mucus production in this cell line. Furthermore, 
expression of this gene is up-regulated in bronchial epithelium upon TNF-a and IL-1 

25 treatment. Finally, moderate expression of this gene is also seen in activated B cells. 

This gene encodes for a protein with homology to the mixed lineage kinase 2 (MLK2) 
that was reported to activate JNK pathway (ref. 1). Activation of this pathway has been 
associated with many inflammatory reactions in many cell types. Il-lb that is produced during 
airway inflammation has been shown to regulate JNK pathway, for example (ref. 2). 

30 Furthermore, the role of 11-4 and IL-1 3 in airway remodeling appears also to use INK pathway 
(ref. 3). Finally, JNK appears to be required for the production of metalloproteinases (ref. 4), 
molecules that play an important role in inflammatory diseases such as rheumatoid arthritis, 
asthma, and inflammatory bowel disease (IBD). Therefore, modulation of the expression or 
activity of the CG56071-01 gene or its protein product by small molecule drugs could be 
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beneficial for the treatment of inflammatory diseases such as in chronic obstructive pulmonary 
disease, asthma, emphysema and also rheumatoid artliritis/osteoarthritis, BD and psoriasis 
(Hirai S, Noda K, Moriguchi T, Nishida'E, Yamashita A, Deyarna T, Fukuyama K, Ohno.Si 
Differential activation of two JNK activators, MKK7 and SEK1, by MKJNT2 8 -derived 
5 nonreceptor serine/threonine kinase/mixed lineage kinase 2. J Biol Chem 1998 Mar 
27;273(13):7406-12; Hallsworth MP, Moir.LM, Lai D, Hirst SJ. Inhibitors of mitogen- 
activated protein kinases differentially regulate eosinophil-activating cytokine release from 
human airway smooth muscle. Am J Respir Crit Care Med 2001 Aug 15;164(4):688-97; 
Hashimoto S, Gon Y, Takeshita 1, Maruoka S, Hone T. IL-4 and IL-13 induce myofibroblastic 
10 phenotype of human lung fibroblasts through c-Jun NH2-terminal kinase-dependent pathway. 
J Allergy Clin Immunol 2001 Jun; 107(6): 1001 -8; Han Z, Boyle DL, Chang L, Bennett B, 
' Karin M, Yang L, Manning AM, Firestein GS. c-Jun N-terminal kinase is required for 
metalloproteinase expression and joint destruction in inflammatory arthritis. J Clin Invest 2001 
Jul;108(l):73-81). 

15 Panel 4D Summary: Ag2872 The results from this experiment are almost identical to 

what is seen in Panel 4. ID. Expression of the CG56071-01 gene is highest in small airway 
epithelium treated with TNF-a and IL-lb (CT = 27.8). This gene is also expressed to a lower 
extent in untreated small airway epithelium tissue as well as in the muco epidermoid cell line 
H292 upon treatment with the Th2 cytokines IL-4 and 11-9, cytokines that are responsible for 

20 increasing mucus production in this cell line. Furthermore, expression of this gene is up- 
regulated in bronchial epithelium upon TNF-a and IL-1 treatment. Finally, moderate 
expression of this gene is also seen in activated B cells. 

This gene encodes for a protein with homology to the mixed lineage kinase 2 (MLK2) 
that was reported to activate JNK pathway (ref. 1). Activation of this pathway has been 

25 associated with many inflammatory reactions in many cell types. II- lb that is produced during 
airway inflammation has been shown to regulate JNK pathway, for example (ref. 2). 
Furthermore, the role of 11-4 and IL-13 in airway remodeling appears also to use JNK pathway 
(ref. 3). Finally, JNK appears to be required for the production of metalloproteinases (ref. 4), 
molecules that play an important role in inflammatory diseases such as rheumatoid arthritis, 

30 asthma, and inflammatory bowel disease (L3D). Therefore, modulation of the expression or 
activity of the CG56071-01 gene or its protein product by small molecule drugs could be 
beneficial for the treatment of inflammatory diseases such as in chronic obstructive pulmonary 
disease, asthma, emphysema and also rheumatoid arthritis/osteoarthritis, IBD and psoriasis 
(Hirai S, Noda K, Moriguchi T, Nishida E, Yamashita A, Deyama T, Fukuyama K, Ohno S. 
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Differential activation of two JNK activators, MKK7 and SEK1, by MKN28-derived 
" nonreceptor serine/threonine kinase/mixed lineage kinase 2. J Biol Chem 1998 Mar 
27;273(13):7406-12; Hallsworth MP, Moir LM, Lai D; Hirst SJ. Inhibitors of mitogen- 
activated protein kinases differentially regulate eosinophil-activating cytokine release from 
5 human airway smooth muscle. Am J Respir Crit Care Med 2001 Aug 15;164(4):688-97; 

Hashimoto S, Gon Y, Takeshita I, Maruoka.S, Horie T. IL-4 and IL-13 induce myofibroblast! c 
phenotype of human lung fibroblasts through c-Jun NH2-terminal kinase-dependent pathway, 
J Allergy Clin Immunol 2001 Jun; 107(6): 1001 -8; Han Z, Boyle DL, Chang L, Bennett B, 
Karin M, Yang L, Manning AM,'Firestein GS. c-Jun N-terminal kinase is required for 
1 0 metalloproteinase expression and joint destruction in inflammatory arthritis. J Clin Invest 2o6l 
Jul;108(l):73-81). 

1 ■ Panel 5 Islet Summary: Ag2872 The CG56071-01 gene is expressed at low to 
moderate levels in pancreatic islet cells and placenta in panel 51. Please refer to Panel 1.5 for a 
synopsis of the potential function of this MLK2-like gene in endocrine and metabolic 
15 disorders. 

Panel CNS_1 Summary: A&2872 This panel confirms the low to moderate 
expression of the CG56071-01 gene in the CNS in an independent group of patients. 
H. NOV10: INTERLEUKIN-1 LIKE PROTEIN 1 

Expression of the NOV10 gene (CG56083-01) was assessed using the primer-probe 
20 sets Ag2909 and Ag4936, described in Tables 64-65 . 



Table 64 . Probe Name Ag2909 



Primers 


Sequences 


Length 


Start 
Position 


Forward 


5 1 ~gcatgtgtggagacagaagag-3 ' (SEQ ID NO:226) 


21 


205 


Probe 


TET-5 1 ~ cttccctacagctggagcagccagt - 3 1 -TAMRA {SEQ ID 
NO:227) 


25 


230 


Reverse 


5 ' -accaagatagagctccatgatg-3 * (SEQ ID N0:228) 


22 


258 


Table 65. Probe Name Ag4936 


Primers; 


Sequences 


Length, 


Start 
Position 


Forwardjs ' -ggcatgtgtggagacagaag-3 ' (SEQ ID NO:229) 


20 | 


204 


' |TET-5 ■ -cttccctacagctggagcagccagt-3 • -TAMRA (SEQ ID: 
ProbS |NO:230) 


25 


230 


Reversej5 ' -ccaagatagagctccatgatgt-3 1 (SEQ ID NO:231) 


22 : 


257 



CNSneurodegenerationvl.O Summary: Ag2909 Expression of this gene is 
25 low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). Probe 
failure is also a possibility. 
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General_screeningjpanel_vl.4 Summary: Ag4936 Expression of this gene is 
low/undetectable (CTs >35) across all of the samples on this panel '(data not shown). Probe 
failure is also a. possibility. i , r 

Panel 1.3D Summary: Ag2909 Expression of this gene is low/undetectable (CTs > 
5 35) across all of the samples on this panel (data not shown). Probe failure is also a possibility. 

Pane) 2D Summary: Ag2909 Expression of this gene is low/undetectable (CTs > 35) 
across all of the samples on this nanel (data not shown). Probe failure is also a possibility. 

Panel 4.1D Summary: Ag4936 Expression of this gene is low/undetectable (CTs > 
35) across all of the samples on this panel (data not shown). Probe failure is also a possibility. 
10 Panel 4D Summary: Ag2909 Expression of this gene is low/undetectable (CTs > 35) 

across all of the samples on this panel (data not shown). Probe failure is also a possibility. 

i 

I. NO VI 1 : INTERJLEUKIN 1 SIGNAL TRANSDUCER-LIKE (also known as TRAF6-like) 

Expression of gene CG^6093-01 was assessed using the primer-probe set Ag2S89, 
1 5 described in Table 66. Results of the RTQ-PCR runs are shown in Tables 67-71 . 
Table 66 . Probe Name Ag2889 



Primers 


Sequences 


Length 


Start 
Position 


Forward? 


5 • -gcaaaaccacgaagagataatg-3 ' (SEQ ID NO: 232) 


22* 


1346 


Probe 


TET-5 1 -caaaccagagctgcttgctttccag-3 * -TAMRA (SEQ ID 
N0:233) 


25 


1373 


Reverse 


5 ' -ttacatagccaaaaccttttgg-3 ' (SEQ ID NO:234) 


22 


1419 



Table 67 . CNS_neurodegeneration_vl.O 



Tissue Name 


Reh Exp.(%) Ag2889, Run 
224116293 


Tissue Name 


Rel. Exp.(%) Ag2889, Run 
224116293 


AD 1 Hippo 


17.9 


Control (Path) 3 
Temporal Ctx 


4.3 


AD 2 Hippo 


38.4 


Control (Path) 4 
Temporal Ctx 


39.8 


AD 3 Hippo 


11.0 


AD 1 Occipital Ctx 


15.3 


AD 4 Hippo 


9.0 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


87.1 


AD 3 Occipital Ctx 


7.6 


AD 6 Hippo 


54.7 


AD 4 Occipital Ctx 


12.8 


Control 2 Hippo 


13.6 


AD 5 Occipital Ctx 


13.6 


Control 4 Hippo 


9.9 


AD 6 Occipital Ctx 


35.6 


Control (Path) 3 Hippo 


9.0 


Control 1 Occipital Ctx 


3.6 


AD 1 Temporal Ctx 


32.5 


Control 2 Occipital Ctx 


40.1 


AD 2 Temporal Ctx 


40.3 


Control 3 Occipital Ctx 


21.3 


AD 3 Temporal Ctx 


11.3 


Control 4 Occipital Ctx 


7.4 


AD 4 Temporal Ctx 


33.2 


Control (Path) 1 
Occipital Ctx 


100.0 
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AD 5 Inf Temporal Ctx 


99.3 

- 


Control fPatm 2 
Occipital Ctx 


26.1 


j\u d »~>up i emporai 
Ctx 


5,4.0 


Control (Path) 3 
Occipital Ctx 


/, 2.P 


AD 6 Inf Temporal Ctx 


49.7 


Control (Path) 4 
Occipital Ctx ■ 


27.0 


Ctx 


51.4 


Control 1 Parietal Ctx 


10.2 


Control 1 Temporal Ctx 


11.1 


Control 2 Parietal Ctx 


17.0 


Control 2 Temporal Ctx 


17.7 


Control' 3 Parietal Ctx 


29.1 ., 


Control 3 Temporal Ctx 


21.9 


Control (Patli) 1 
Parietal Ctx 


32.5 


Control 3 Temporal Ctx 


10.3 


Control (Path) 2 
Parietal Ctx 


42.0 


Control (Path) 1 
Temporal Ctx* 


70.7 ; 


Control (Path) 3 
Parietal Ctx 


4.3 


Control (Path) 2 
Temporal Ctx 


48.6 


Control (Path) 4 
Parietal Ctx i 


51.4 



Table 68 . Panel 1.3D 



Tissue Name 


■ ■ ' — • — | 

Reh Exp.'(%) Ag2889, Run 
155754402 I 


Tissue Name' 


Rel. Exp r (%) Ag2889, Run 
155754402 


Liver adenocarcinoma 


5.9 


Kidney (fetal) 


15.2 


Pancreas 


o c \ 

O.J ( 


Kenai ca. /oo-u 


10 1 


Pancreatic ca. CAP AN 2 > 


3.1 1 


Renal ca. A498 


53.2 


Adrenal gland 


9.4 


Renal ca. RXF 393 


6.9 


Thyroid 


16.4 


Renal ca. ACHN 


5.0 


Salivary gland 


8.1 ! 


Renal ca.UO-31 


16.8 


Pituitary gland 


17.7 ; 


Renal ca. TK-10 


8.0 


Brain (fetal) 


10.2 


Liver 


4.3 


Brain (whole) ■ 


18.7 | 


Liver (fetal) 


^ 9.2 


Brain (amygdala) 


16.6 1 


Liver ca. (hepatoblast) ! 
HepG2 


n.i 


Brain (cerebellum) 


9.7 


Lung 


26.6 


Brain (hippocampus) 


74.7 


Lung (fetal) 


13.7 


Brain (substantia nigra) 


8.4 i 


Lung ca. (small cell) LX- : 
1 


17.6 


Brain (thalamus) 


15.2 \ 


Lung ca. (small cell) 
NCI-H69 


37.1 


Cerebral Cortex 


50.7 


Lung ca. (s.cell var.) 
SHP-77 


30.1 


Spinal cord 


13.5 


Lung ca. (large cell)NCI- 
H460 


2.9 


glio/astro U87-MG 


18.7 


Lung ca. (non-sm. cell) 
A549 


21.3 


glio/astro U-118-MG 


56.6 


Lung ca. (non-s.cell) 
NCLH23 


14.8 


astrocytoma SW1783 


16.8 


Lung ca. (non-s.cell) 
HOP-62 


8.5 


neuro*; met SK-N-AS 


100.0 


Lung ca. (non-s.cl) NCI- 
H522 


7.5 


astrocytoma SF-539 


17.6 


Lung ca. (squam.) SW 


3.8 
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j|900 




astrocytoma SNB-75 


11.1 


iLung ca. (squam.) NCI- 
JH596 


8.8 


glioma SNB-19 


27.0 


jMammary gland 


21.5 


i — — ■— - 

glioma U251 


18,8 


— _ — — -p* — • ■—■ — — * — — ~ — — r 

Breast ca.*, (pl.ef) MCF- . 

I 7 


25.0 


glioma SF-295 


8.7 


jBreast ca.* (pl.ef) MDA- 
jMB-231 


72.2 


Heart (fetal) 


8.1 


jBreast ca.* (pl.ef) T47D 


14.1 


Heart 


4.8 


\t~\ TIT* C A c\ 

jBreast ca. BT-549 


30.4 


Skeletal muscle (fetal) 


49.0 


|Breast ca. MDA-N 




Skeletal muscle 


4.8 


|Ovary 


10 0 


Bone marrow 


' 9.5 


jovarian ca. OVCAR-3 


T\ 7 


Thymus 


21.8 


jOvarian ca. OVCAR-4 


3.0 1 


Spleen 


32.5 


jOvarian ca. OVCAR-5 


22.7 


Lymph node 


9.3 


!|Ovarian ca. OVCAR-8 ! 


17.4 


Colorectal 


15.9 


jOvarian ca. IGROV-l 


5.0 


oiu 11 ia\/ xx 


13.7 


: Ovarian ca.* (ascites) : 
JSK-OV-3 


31.6 


Small intestine 


20.0 


(Uterus 


8.0 


Colon ca. SW480 


34.2 


[Placenta 


17.2 


Colon ca.* SW620(SW480 
met) 


21.0 


jprostate 


4.5 


V-^CJiUH t-a. xx 1 £*y 


12.9 


[Prostate ca.* (bone 
|met)PC-3 


16.3 


Colon ca. HCT-116 


15.5 


jTestis 


27.5 


Colon ca. CaCo-2 


18.8 


(Melanoma Hs688(A).T 


6.3 


Colon ca. 
tissue(OD03866) 


23.5 


jMelanoma* (met) 
|Hs688(B).T 


4.1 


Colon ca. HCC-2998 


46.7 


Melanoma UACC-62 


2.5 


Gastric ca.* (liver met) 
NCI-N87 


34.4 


JMelanoma M14 


4.0 


Bladder 


20.0 


jMelanoma LOX IMVI 


18.6 


Trachea 


22.5 


jMelanoma* (met) SK- 
]MEL-5 


18.9 


Kidney 


5.1 


jAdipose 


10.5 



Table 69 . Panel 2.2 



Tissue Name 


Rel. Exp.(%) Ag2889, 
Run 175119377 


Tissue Name 


Rel. Exp,(%) Ag2889, 
Run 175119377 


Normal Colon 


19.5 • 


Kidney Margin (OD04348) 


100.0 


Colon cancer (OD06064) 


31.9 


Kidney malignant cancer 
(OD06204B) 


7.0 


Colon Margin (OD06064) 


11.9 


Kidney normal adjacent 
tissue (OD06204E) 


14.5 


Colon cancer (OD06159) 


1.8 


Kidney Cancer (OD04450- 
01) 


81.2 


Colon Margin (OD06159) 


12.9 


Kidney Margin (OD04450- 
03) 


24.8 


Colon cancer (OD06297-04) 


6.6 


Kidney Cancer 8120613 


3.0 


Colon Margin (OD06297- 


27.4 


Kidney Margin 8120614 


6.4 
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015) 




1 


• 


CC Gr.2 ascend colon i 
(OD03921) 


0.4 


Tf-irliVov CnnrPT 0010390 




CC Margin (OD03921) ' 


1 5.1 


Kidney Margin 9010321 




Colon cancer metastasis 
(OD06104) 


6.7 


Kidney Cancer 8120607 


5.6 


Lung Margin (OD061 04) j 


11.3 


Kidney Margin 8120608 


2,6 


Colon mets to lung 
(OD04451-01) 


16.0 


Normal Uterus 


, 39.0 


Lung Margin (OD04451-02) 


, 20.7 


Uterine Cancer 064011 


10.8- 


Normal Prostate 


4,1 


Normal Thyroid 


2.9 


Prostate Cancer (OD044 1 0) 


3.4 


Thyroid Cancer 064010 


3.8 


Prostate Margin ("00044 10) 


1A 


Thyroid Cancer A302152 


12.8 


KTnrrno 1 f *\ -i mrt t ' 

INUlJTkll wvdiy 


8.4 


Thyroid Margin A302153 


0.0 


Ovarian cancer (OD06283- 


6.2 


Normal Breast 


31.2 , 


Ovarian Margin (OD06283- ; 

U/) 


10.7 


Breast Cancer (OD04566) 


3.6 


r\i;?inan PanrAr C\f^AC\f\Q 

L/vanan cancer \joh uuo ; 


1 1 7 
11,/ 


Rrrfl«:t Cancer 1024 


4.3 


Ovarian cancer (OD06 145) 

. ■ - ■ — -■ ! — ^ i 


1.8 


Breast Cancer (OD0ft590- 

on 

VJl ) 


, 13. 1 


Ovarian Margin (OD06145) 


20.2 


(OD04590-03) 


46.7 


Ovarian cancer (UJUUo4;o- 
03) 


11.7 


(OD04655-05) 


24.1 


Ovarian Margin (OD06455- | 
07) 


. 9.3 


Breast Cancer 064006 


13.0 


Normal Lung 


15.4 


Breast Cancer 9100266 


13.9 


Invasive poor diff. lung 
adeno (ODO4945-01 


5,4 


Breast Margm 9 1 00265 


O 1 

y.5 


Lung Margin (ODO4945-03) 


15.7 


Breast Cancer A209073 


7.0 


Lung Malignant Cancer 
(OD03126) 




Breast Marpin A2090734 

iJibUoL lYAfll Kill ifc^-V/ \s\J t *s » 


21.9 


Lung Margin (OD03 126) 


6.3 


Breast cancer (OD06083) 


30.4 




7.0 


Breast cancer node 
metastasis (OD06083) 


28.5 


Lung Margin (OD05014B) 


23.5 


Normal Liver 


19.3 


Lung cancer (OD06081) 


7.6 


Liver Cancer 1026 


1.3 


Lung Margin (OD0608 1) 


20.9 


Liver Cancer 1025 


12.2 


Lung Cancer (OD04237-01) 


8.0 


Liver Cancer 6004-T 


7.7 


Lung Margin (OD04237-02) 


32.8 


Liver Tissue 6004-N 


1.6 


Ocular Melanoma Metastasis 


30.6 


Liver Cancer 6005-T 


3.1 


Ocular Melanoma Margin 
(Liver) 


4.9 


Liver Tissue 6005-N 


18,3 


Melanoma Metastasis 


9.7 


Liver Cancer 064003 


7.8 


Melanoma Margin (Lung) 


19.1 


Normal Bladder 


12.9 


Normal Kidney 


20.0 


Bladder Cancer 1023 


1.3 


Kidney Ca, Nuclear grade 2 
(OD04338) 


41.5 


Bladder Cancer A3 02 1 73 


7.9 


Kidney Margin (OD04338) 


14.9 


Normal Stomach 


37.6 


Kidney Ca Nuclear grade 1/2 


49.0 


Gastric Cancer 9060397 


1.0 
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|(OD04339) 








jKidney Margin (OD04339) 


13.8 S 


;tomach Margin 9060396 


5.4 


jKidney Ca, Clear cell type 
|(OD04340) 


22.1 , ■ C 


3astric Cancer 90603?5 


— r 

7.4 


jKidney Margin (OD04340) 


28.9 S 


>tomach Margin 9060394 


22.4 


jKidney Ca, Nuclear grade 3 
j(OD04348) 


6.7 C 


Sastric Cancer 064005 


9.0 


Table 70. Panel 2D 


Tissue Name 


Re). Exp.(%) Ag2889, 
Run 158095068 


i Tissue Name 


Rel. Exp.(%) Ag2889, 
Run 158095068 


Normal Colon 


75.3 


Kidney Margin 8120608 


7.8 


CC Well to Mod Diff 
(OD03866) 


/.y 


isjaney cancer o l zuo l o 


o 


6c Margin (OD03866) 


■12.1 


Kidney Margin 8120614 


, : 1 

7.5 


CC Gr.2 rectosigmoid 
(OD03868) 


7.4 


Kidney Cancer 9010320 


15.3 


CC Margin(OD03868) 


4.2 


Kidney Margin 9010321 


10.4 


CC Mod Diff (ODO3920) 


37.1 


Normal Uterus 


9.7 


CC Margin (ODO3920) 


17.0 


Uterus Cancer 064011 


30.8 


CC Gr.2 ascend colon 
(OD03921) 


37.9 


Normal Thyroid 


22.8 


CC Margin (OD03921) 


9.9 


Thyroid Cancer 064010 


15.7 


CC from Partial Hepatectomy 
(ODO4309) Mets 


28.5 


Thyroid Cancer A302152 


15.4 


Liver Margin (ODO4309) 


21.0 


Thyroid Margin A302153 


36.3 


Colon mets to lung (OD0445 1- 
01) 


ten 


Normal Breast 


is. i 


Lung Margin (OD04451-02) 


10.8 


Breast Cancer (OD045 66) 


13.4 


Normal Prostate 6546-1 


9.6 


Breast Cancer (OD04590- 
01) 


100.0 


Prostate Cancer (OD04410) 


26.4 


Breast Cancer Mets 


82.4 


Prostate Margin (OD04410) 


29.9 


Breast Cancer Metastasis 
(OD04655-05) 


31.6 


Prostate Cancer (OD04720-01) 


34.6 


Breast Cancer 064006 


16.0 


Prostate Margin (OD04720-02) 


47.3 


Breast Cancer 1024 


19.6 


Normal Lung 061010 


50.7 


Breast Cancer 9100266 


24.0 


Lung Met to Muscle 
(OD04286) 


11.0 


Breast Margin 9 1 00265 


9.7 


Muscle Margin (OD04286) 


15.5 


Breast Cancer A209073 


38.7 


Lung Malignant Cancer 
(OD03126) 


28.1 


Breast Margin A2090734 


18.8 


Lung Margin (OD03126) 


39.0 


Normal Liver 


10.2 


Lung Cancer (OD04404) 


28.3 


Liver Cancer 064003 


9.0 


Lung Margin (OD04404) 


14.4 


Liver Cancer 1025 


9.2 


Lung Cancer (OD04565) 


15.0 


Liver Cancer 1026 


4.5 


Lung Margin (OD04565) 


14.5 


Liver Cancer 6004-T 


16.5 


Lung Cancer (OD04237-01) 


51.4 


Liver Tissue 6004-N 


11.8 


Lung Margin (OD04237-02) 


20.7 


Liver Cancer 6005-T 


7.2 


Ocular Mel Met to Liver 


62.9 


Liver Tissue 6005-N 


4.5 
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(ODO4310) 








Liver Margin (0DO43 10) 


11.3 


Normal Bladder 


66.9 


iVX^kalUJllUa IVJLCli* IU JL>Ullg 

(OD04321) 


15.9. 


Bladder Cancer ,1023 


3.0 




39.8 


Bladder Cancer A302173 


27.7 


Normal Kidney 


69.7 


Bladder Cancer 
(OD047 18-01) 


39.2 


(OD04338) 


42.3 


Bladder Normal Adjacent 
(OD047 18-03) 


18.0 


XViUJlCy lVlulglH ^WJLVv l t j DO J 




Nnrmal Ovarv i 

-l ~ l j ixi i woi y i 


6.5 


JV-lvLlJC-V V-'tl ilUVJt'flJl glCIlJE/ J / 

(OD04339) 


62.4 


Ovarian Cancer 064008 


42.9 


Kidney Margin (OD04339) 


■ 

59.9 


Ovarian Cancer 
(OD04768-07) 


J /.0 

- 1. 


Kidney Ca, Clear cell type 
(OD04340) 


77.4 


Ovary Margin (OD04768- 
08) 


8.5 


Kidney Margin (OD04340) 


46.0 


Normal Stomach 


20.4 


Kidney Ca, Nuclear grade 3 
(OD04348) 


10.1 


Gastric Cancer 9060358 


8.3 


Kidney Margin (OD04348) 


33.9 


Stomach Margin 9060359 


12.7 


Kidney Cancer (OD04622-01) 


9.0 


Gastric Cancer 9060395 


27.9 


Kidney Margin (OD04622-03) 


2.8 


Stomach Margin 9060394 


14.6 


Kidney Cancer (OD04450-01) 


27.4 


Gastric Cancer 9060397 


14.2 


Kidney Margin (OD04450-03) 


45.4 


Stomach Margin 9060396 


5.0 


Kidney Cancer 8120607 


6.5 


Gastric Cancer 064005 


76.8 



Table 71 . Panel 4D 



Tissue Name 


ReL Exp.(%) Ag2S89, 
Run 158095071 


Tissue Name 


ReL Exp.(%) Ag2885>, 
Run 158095071 


Secondary Thl act 


22.8 


HUVEC IL-lbeta 


11.9 


Secondary Th2 act 


23.8 


HUVEC IFN gamma > 


17.3 


Secondary Trl act 


20.4 


HUVEC TNF alpha + IFN 
gamma 


21.0 


Secondary Thl rest 


9.5 


HUVEC TNF alpha + IL4 


17.4 


Secondary Th2 rest 


13.4 


IiUVEC 1L-11 


7.5 


Secondary Trl rest 


10.1 


Lung Microvascular EC none 


6.9 


Primary Thl act ; 


32.5 


Lung Microvascular EC 
TNFalpha + IL^lbeta 


20.6 


Primary Th2 act 


29.5 


Microvascular Dermal EC none 


21.5 


Primary Trl act 


39.8 l 


Microsvasular Dermal EC j 
TNFalpha + IL- 1 beta 


17.8 


Primary Thl rest 


55.5 


Bronclrial epithelium TNFalpha 
+ ILlbeta 


14.4 


Primary Th2 rest 


25.7 


Small airway epithelium none 


14.1 


Primary Trl rest 


21.6 


Small airway epithelium 
TNFalpha +IL-1 beta 


100,0 


CD45RA CD4 lymphocyte 
act 


12.6 


Coronery artery SMC rest 


15.9 


CD45RO CD4 lymphocyte 
act 


33.9 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


6.9 


CD8 lymphocyte act 


13.2 


Astrocytes rest 


13.6 
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Secondary CD8. 
lymphocyte rest 


13.5 


jAstrocytes TNFalpha + IL-lbeta 


15.7 ' 


Secondary CD8 
lymphocyte act 


9.3 

i 


HOT-SI 9 fftasonhin rest 


91 0 


y-syj-t ty iiijpxiuv/ y iw iivjuc 


18.8 


KU-8 12 (Basophil) 
jPMA/ionomycin . 




2ryThl/Th2/Trl anti- 
CD95 CH11 


13.6 


JcCDl 106 (Keratinocytes) none 


13.2 


LAIC cells rest 


30.6 


jCCDl 106 (Keratinocytes) 
jTNFalpha + IL-lbeta 


■ 7.3 


LAK cells IL-2 


28.1 


1 It ■ ■ t. ' • 

JLiver cirrhosis 


6.9 


LAK cells IL-2+IL-12 


21.0 


It i -J 

|Lupus kidney 


4,9 


LAK cells IL-2+IFN 
i 


28.3 


|nCI-H292 none 


58.6 


LAKcellsIL-2+IL-18 ' 


42.9 


(NCI-H292 IL-4 , 
i . ...... .. . 


42.6 


LAK cells 
PMA/ionomycin 


14.1 


NCI-H292 IL-9 

i 


72.2 . 


XTV r^ollr. TT 1 fact 

injpv. v_,eus il,-jL rest 




|"Mr , T TJTOQO TT 11 




i wo way Miviv j ciay 




l"M/^T T-T009 TT7M nnrr-trnn 

^jiNci*-rizy-4 iris gamma 


1A 3 


i wo Way JVlJLK d day 




jriJKA.DC none 




Two Way MLR 7 day 


' 4.7 


|HPAEC TNF alpha + IL-1 beta ' 


23.5 


PBMC rest 


14.1 


jLung fibroblast none 


7.2 


PBMC PWM 


68.3 


- jLung fibroblast TNF alpha + IL- 
|l beta 


7.0 


PBMC PHA-L 


19.8 


jLung fibroblast IL-4 


21.9 


Ramos (B cell) none 


17.3 


JLung fibroblast IL-9 


8.4 


Ramos (B cell) ionomycin 


59.5 


jLung fibroblast IL-13 


12.4 


B lymphocytes PWM 


55.5 


|Lung fibroblast EFN gamma 


28.1 


B lymphocytes CD40L 
and IL-4 


32.1 


jDermal fibroblast CCD 1070 rest; 


24.3 


EOL-1 dbcAMP 


13.6 


Dermal fibroblast CCD 1070 

j TXT tT olnltd 

^ lJNr aipna 


54.7 


PTfYT 1 ^Kr>AAyfP 
JCAJJU-I QDCJ\Nlr 

PMA/ionomycin 


37.4 


jjermai iioroDiasi ccluu/v il>- 
jl beta 


10.9 


Dendritic cells none 


15.9 


jDermal fibroblast IFN gamma 


11.7 


Dendritic cells LPS 


14.1 


|Dermal fibroblast IL-4 


21,5 


Dendritic cells anri~CD40 


11.7 


jlBD Colitis 2 


1.8 


Monocytes rest 


21.2 


jlBD Crohn's 


3.0 


Monocytes LPS 


21.6 


jColon 


28.3 


Macrophages rest 


21.0 


]Lung 


20.2 


Macrophages LPS 


16.4 


(Thymus 


47.0 


HUVEC none 


18.0 


jKidney 


100.0 


HUVEC starved 


42.0 j 





CNSneurodegenerationvl.O Summary: Ag2889 This experiment confirms the 
expression of the CG56093-01 gene at moderate level in the CNS in an independent group of 



patients. However, no differential expression of this gene was found between Alzheimer's 
disease and control postmortem brains. Please see Panel 1.3D for a discussion of utility of this 
5 gene in the central nervous system. 
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Panel' 1. 3D Summary: Ag2889 Expression of the CG56093-01 gene is highest in a 
sample derived from brain cancer cell line SK-N-AS (CT = 27.9). Thus, expression of this 
gene could be used to distinguish SK-N-AS cell derived samples from other samples in the 
panel. In addition, there is substantial expression of this gene in samples derived from other 
brain cancer cell lines as well as in breast cancer and renal cancer cell lines. Therefore, 
therapeutic modulation of the activity of this gene or its protein product, through the use of 
small molecule drugs, antibodies' or protein therapeutics, might be beneficial in the treatment 
of brain cancer, breast cancer and renal cancer. 

This gene is also .expressed at moderate levels in all regions of the central nervous 
system examined, including amygdala, cerebellum, hippocampus, substantia nigra, thalamus, 
cerebral cortex and spinal cord (CTs = 28.3-31.4). This gene encodes a protein with homology 
to interleukin 1 signal transducer, also known as TRAP6. Interleukin 1 signal transducer is an 
important molecule involved in different aspects of cellular regulation, including 
inflammation. Inflammation has been implicated' in the pathophysiology of Alzheimer's 
disease, stroke, and spinal cord and brain trauma. Therefore, therapeutic modulation of this 
gene or its protein product may be beneficial in reducing the neuronal death associated with 
any of these conditions. 

In addition, the CG5 6093-01 gene expressed at moderate levels in several metabolic 
and endocrine tissues, including adipose, adrenal gland, liver, pancreas, skeletal muscle and 
thyroid. Therefore, therapeutic modulation of the activity of this gene or its protein product 
could be beneficial in the treatment of endocrine or metabolic diseases, such as obesity and 
type II diabetes. Interestingly, this gene is expressed at higher levels in fetal skeletal muscle 
(CT = 28.9) than in adult skeletal muscle (CT = 32.2), suggesting that expression of this gene 
can be used to distinguish these tissues. In addition, the relative overexpression of this gene in 
fetal skeletal muscle suggests that the protein product may enhance muscular growth or 
development in the fetus and thus may also act in a regenerative capacity in the adult. 
Therefore, therapeutic modulation of the protein encoded by this gene could be useful in 
treatment of muscle related diseases. More specifically, treatment of weak or dystrophic 
muscle with the protein encoded by this gene could restore muscle mass or function. 

Panel 2.2 Summary: Ag2889 Expression of the CG56093-01 gene is highest in a 
sample derived from a sample of normal tissue adjacent to a kidney cancer (CT - 32T). Thus, 
expression of this gene could be used to distinguish this normal kidney tissue sample from 
other samples in the panel. In addition, there is low but significant expression of this gene in 
samples derived from kidney cancer, normal stomach tissue, normal uterus tissue, normal 
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breast tissue and breast cancer tissue. Therefore, therapeutic modulation of the activity of this 
" gene or its protein product, through the use of small molecule drugs, protein therapeutics or ■ 
antibodies, might be beneficial in the treatment of kidfiey cancer of breast cancer. 

Panel 2D Summary: Ag2889 Expression of the CG56093-01 gene is highest in a 
5 sample derived from breast cancer tissue (CT = 28.7). Thus, the expression of this gene could 
be used to distinguish breast cancer tissue sample from other samples in the panel. In addition, 
there is substantial expression of this gene in samples derived from other breast cancer tissues, 
kidney cancer, gastric cancer tissue, normal bladder tissue, and normal colon tissue. Therefore, 
therapeutic modulation of the activity of this gene or its protein product, through the use of 

10 ^m^ll molecule drugs, protein therapeutics or antibodies, might be beneficial in the treatment 
of kidney cancer, gastric cancer or breast cancer. 

Panel 4D Summary: Ag2889 The CG56093-01 gene is expressed at moderate 
levels in the majority of samples on this panel. However, expression of this gene is 
highest in small airway epithelium treated with TNF-alpha and IL-1 beta (CT = 27). 

15 This gene encodes a protein with homology to IL-1 signal transducer protein (also 

known as TRAF6), a protein involved in IL-1 and TNF receptor signaling. Therefore, 
modulation of the expression or activity of this protein by small molecule drugs could 
block the functions of B cells, T cells, and monocytes as well as block the generation 
of inflammatory cytokines from damaged lung epithelium, leading to the improvement 

20 of symptoms of patients suffering from autoimmune and inflammatory diseases such as 
asthma, chronic obstructive pulmonary diseases, emphysema, allergies, inflammatory 
bowel disease, lupus erythematosus, or rheumatoid arthritis. 
I. NOV13: Glucuronosyltransferase 

Expression of gene CG56097-01 was assessed using the primer-probe set Ag2907, 

25 described in Table 72. Results of the RTQ-PCR runs are shown in Tables 73-76. 
Table 72 . Probe Name Ag2907 



Primers 


Sequences 






jl/engthj 


Start 
Position 


Forward 


5 1 -ggctcattcgaaactactggta-3 1 (SEQ 


ID 


N0;235) 


1 j 


773 


Probe 


TET-5 • -tggaatttcctcgcccactcttacct- 
ID NO:236) 


3 1 • 


-TAMRA (SEQ 


| 26 l 


797 


Reverse; 


5 ' -ggttgacaggtttgcagtagag-3 ■ (SEQ 


ID 


NO:237) 


1 « j 


844 


Table 73. Panel 1.3D 



Tissue Name 


1 UeLExp.(%) : 
Ag2907,Run ; 
j 157283423 


Rel.Exp.(%) 
Ag2907,Run 
165701505 


Tissue Name 


Rel Exp.(%) 
Ag2907, Run 
157283423 


Rel. Exp.(%) 
Ag2907, Run 
165701505 


Liver 


j 0.0 


0.0 


Kidney (fetal) 


0.0 


0.0 
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adenocarcinoma 




J 






Pancreas 


! 0.0 


0.0 


{Renal ca. 786-0 


0.0 


0.0 


Pancreatic ca. 
CAP AN 2 


0.0 


•0.0 


jltenal ca. A498 ■ 


0.0 


0.0 


Adrenal gland 


0.0 


0.0 


[Renal ca. RXF , 
(393 ■ 


0.0 


0.0 


Thyroid 


0.0 


0.0 


[Renal ca. ACHN 


0.0 


0.0 


Salivary gland 


0.0 


0.0 


iRenal ca UO-31 


0.0 


0.0 


Pituitary pland 


100.O 


100.0 


jRenal ca. TK-10 


2.0 


28.3 


Brain (fetal) 


0.0 


0.0 


It -j ' 

JLiver 


0.0 


0.0 


Brain (whole) 


0.0 


14.3 


jLiver (fetal) 


A A 

U.U 


a n 
U.U 


Brain (amygdala) 
i 

■ 


1 

0.0 


0.0 


[Liver ca. ' 

(hepatoblast) 

|HepG2 


A A 

U.U 


fi A 

1 


Brain (cerebellum) 


0.0 


11.9 


(Lung ; 


0.0 


U.O 


Brain (hippocampus) 


1.9 


0.0 


|Lung (fetal) 


0.0 


0.0 


Brain (substantia 
nigra) 


0.0 j 


u.u 


jLung ca. (small 
(cell) LX-1 


0.0 


0.0 


Brain (thalamus) 


0.0 


a (\ 
u.u 


[Lung ca. (small ' 
(dell) NCI-H69 I 


1.1 


0.0 


Cerebral Cortex 


2.0 


n a 
u.u 


JLung ca, (s.cell 
jvar.)SHP-77 


0.0 


0.0 


Spinal cord 


1.3 ; 


a a 
u.u 


jLung ca. (large 
|cell)NCI : H460 ; 


0.0 


0.0 


glio/astroTJ87*-MG 


0.0 


12.2 


Lung ca. (non- 
Ism, cell) A549 i 


0.0 


0.0 


glio/astroU-118- j 
MG 


0.0 


a a 
u.u 


iLung ca. (non- 
|s.cell) NCI-H23 


0.0 


0.0 


astrocytoma 
SW1783 


0.0 


u.u 


jLung ca. (non- 
jsxell) HOP-62 


0.0 


0.0 


neuro*; met SK-N- 
AS 


1.0 : 


0.0 


jLung ca. (non- I 
|s.cl)NCI-H522 1 


0.0 


0.0 




I A \ 


0.0 


Lung ca. 

[(squam.) SW 900; 


0.9 


0.0 


astrocytoma SNB-75 


1.0 


0.0 


[Lung ca. 
(squam.) NCI- 
(H596 


0.0 


0.0 


glioma SNB-19 


1.8 


0.0 


(Mammary gland 


0.0 


0.0 


pHnmn TT2S1 


0.8 




jBreast ca.* 
[(pl.ef) MCF-7 


1.4 


2.5 


glioma SF-295 


1.0 


0.0 


Breast ca.* 
(pl.ef) MDA- 

"NyftJ Oil 


2.6 


0.0 


Heart (fetal) 


0.0 


0.0 


[Breast ca.* 
[(pl.ef) T47D 


0.0 


0.0 


Heart 


0.0 


0.0 


Breast ca. BT~ 
[549 


3.3 


0.0 


Skeletal muscle 
(fetal) 


L0 


0.0 


[Breast ca. MDA- 
)N 


0.0 


0.0 


Skeletal muscle 


0.0 


0.0 


jOvary 


0.0 


0.0 


Bone marrow 


0.0 


0.0 


Ovarian ca. 
|0VCAR-3 


2.9 


0.0 


Thymus 


0.0 


0.0 


jOvarian ca. 


0.0 


0.0 
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OVPAR-4 






Spleen 


0.9* 


" 0.0 


OVCAR-5 


' 3.0 


10.9 


Lymph node 

— ^ , _i 


' 0.0 ' 


' o.o 


Ovarian ca. 
OVCAR^8 


o.o. 1 


0.0 


Colorectal 


0.0 


0.0 


Ovarian ca. 
IGROV-1 


0.0 


0.0 


Stomach 


0.0 


0.0 


Ovarian ca.* 
(ascites) SK-OV- 
3 


0.0 


0.0 


Small intestine 


0.0 


' 0.0 


Uterus 


0.0 


o.o ; 


Colon ca. SW480 


, o.o 


0.0 


Placenta 


0.9 


0.0 


Colon ca.* | 
SW620(SW480 met) 


0.0 


0.0 


Prostate 


0.0 


0.0 


Colon ca. HT29 


i.o ; 


0.0 


jTiUolalc- Uli 

(bone met)PC-3 i 


1.3 


0.0 


i—oion ca. riv^ i - 1 1 o 


o.u 


o n 

u.u 


Tactic 




9.5 


Colon ca. CaCo-2 


1.0 


0.0 


Melanoma 

XiSOoo\/\J. I 


1 0.0 


0.0 


Colon ca. | 
tissue(OD03 8 66) 


U.O 

j.- • 


1 ' 

ii a 


Melanoma* 
(met) 

Hs688(B).T j 


f\ fi 
i v.U 


0 0 


Colon ca. HCC-2998^ 


2.6 ! 


0.0 , i 


Melanoma 
UACC-62 


0.0 


0.0 


Gastric ca.* (liver 
met) NCI-N87 


1.2 


13.3 


Melanoma M 14 


0.0 


0.0 


Bladder 


0.0 


6.7 


Melanoma LOX j 
IMVI " 


0.0 


0.0 


Trachea 


0.0 


0.0 


Melanoma* 
(met) SK-MEL-5 


0.0 


0.0 


Kidney j 


0.0 


0.0 


Adipose 


1.5 


0.0 



Table 74 . Panel 2D 



Tissue Name 


Rel. Exp.(%) Ag2907, 
Run 157284121 


Tissue Name 


Rel. Exp.(%) Ag2907, 
Run 157284121 


Normal Colon 


4.5 


Kidney Margin 8120608 


0.0 


CC Well to Mod Diff 
(OD03866) 


2.1 


Kidney Cancer 8120613 


0.0 


CC Margin (OD03866) 


2.3 


Kidney Margin 8120614 


0.0 


CC Gr.2 rectosigmoid 
(OD03868) 


1.7 


Kidney Cancer 9010320 


0.0 


CC Margin (OD03868) 


0.0 


Kidney Margin 9010321 


4.8 


CC Mod Diff (ODO3920) 


0.0 


Normal Uterus 


0.0 


CC Margin (ODO3920) 


0.0 


Uterus Cancer 064011 


1.5 


CC Gr.2 ascend colon 
(OD03921) 


0.0 


Normal Thyroid 


0.0 


CC Margin (OD03921) 


6.8 


Thyroid Cancer 064010 


0.0 


CC from Partial Hepatectomy 
(ODO4309) Mets 


0.0 


Thyroid Cancer A302152 


0.0 


Liver Margin (ODO4309) 


0.0 


Thyroid Margin A302153 


15.2 


Colon mets to lung (OD04451- 
01) . 


0.0 


Normal Breast 


1.7 
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Lung Margin (OD04451-02) 


0.0 


Breast Cancer (OD04566) 


0.0 


Normal Prostate 6546-1 


0.0 . 


Breast Cancer (OD04590- 
01) 


0.0 


Prostate Gancer (OD04410) 


0.0 


k-t- — 1 — 1 — ' — • ^ — 

Breast Cancer Mets 
(OD0459.0-03) 


.2.2 


Prostate Margin (OD04410) 


0.0 


Breast Cancer Metastasis 
(OD04655-05) 


2,» 


Prostate Cancer (OD04720-01) 


100.0 


Breast Cancer 064006 


0.0 


Prostate Margin (OD04720-02) 


0.0 


Breast Cancer 1024 


0.0 . 


Normal Lung 061010 


7.2 


Breast Cancer 9100266 


2.4 


Lung Met to Muscle 
(OD04286) 


0.6 


Breast Margin 9100265 


• 0.0 


Muscle Margin (OD04286) 


0.0 


Breast Cancer A209073 


0.0 


Lung Malignant Cancer 
(Ob03126) 


■ 3.4 


Breast Margin A2090734 


0.0 


Lung Margin (OD03 1 26) 


0.0 


Normal Liver 


0.0 


Lung Cancer (OD04404) 


0.0 


Liver Cancer 064003 


3.5 


Lung Margin (OD04404) 


4.1 


Liver Cancer 1025 


0.0 


Lung Cancer (OD04565) 


0.0 


Liver Cancer 3026 j 


0.0 


Lung Margin (OD04565) 


0.0 


Liver Cancer 6004-T 


0.0 


Lung Cancer (OD04237-0 1 ) 


0.0 


Liver Tissue 6004-N 


0.5 


Lung Margin (OD04237-02) 


0.0 


Liver Cancer'6005~T 


0.0 


Ocular Mel Met to Liver 
(ODO4310) 


0.0 


Liver Tissue 6005-N 


0.0 


Liver MaTgin (ODO4310) 


0.0 


Normal Bladder 


0.0 


Melanoma Mets to Lung 
(OD04321) 


0.0 


Bladder Cancer 1023 


0.0 


T linn A/lorni'ti / T^T ^H/l Q O 1 ^ 

Lung Margin {kjukj^d/. l ) 


0 ft 


Rtarlrlpr Panrpr A ^ft?t73 
JDJaClUCI V^-cUll'Cl AJv//,i / J 


22.1 


Normal Kidney 

~ ( 


0.0 


Bladder Cancer 


2.4 


Kidney Ca, Nuclear grade 2 
(OD04338) 


2.0 


ipiauuvx iNuniioi /\ujdL<cni 
(OD047 18-03) 


0.0 


Jvioney Margin (uuw 5jo) 


ft ft 




0.0 


JVKJJjcy V^d iNUt>lCdI gldUC 

(OD04339) 


2.0 


Ovarian Cancer 064008 


0.0 


Kidney Margin (OD04339) 


0.0 


Ovarian Cancer 
(OD04768-07) 


1 A A 

14.4 


Kidney Ca, Clear cell type 
(OD04340) 


0.0 


Ovary Margin (OD04768- 
08) 


0.0 


Kidney Margin (OD04340) 


2.4 


Normal Stomach 


0.0 


Kidney Ca, Nuclear grade 3 
(OD04348) 


0.0 


Gastric Cancer 9060358 


0.0 


Kidney Margin (OD04348) 


0.0 


Stomach Margin 9060359 


0.0 


Kidney Cancer (OD04622-01) 


0.0 


Gastric Cancer 9060395 


0.0 


Kidney Margin (OD04622-03) 


0.0 


Stomach Margin 9060394 


0.0 


Kidney Cancer (OD04450-01) 


0.0 


Gastric Cancer 9060397 


0.0 


Kidney Margin (OD04450-03) 


0.0 


Stomach Margin 9060396 


0.0 


Kidney Cancer 8120607 


2.1 


Gastric Cancer 064005 


2.9 



Table 75 . Panel 3D 



Tissue Name 



Rel. Exp.(%) 



Tissue Name 



Rel. Exp.(%) 
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f ► 


Ag2907, Run 
,.164033936 


! 


Ag2907> Run 
, . 164633936 


Daoy- Medulloblastoma 


0.0 

i 


Ca Ski- Cervical epidermoid 
carcinoma (metastasis) , 


49.7 


TE671- Medullo ( blastoma 


0.0 


ES-2- Ovarian clear, cell carcinoma ; 


A A 
U.U 


D283 Med- Medulloblastoma 


0.0 


Ramos- Stimulated witli i 
PMA/ionomycin 6b j 


0.0 


PFSK-1- Primitive 
Neuroectodermal 


15.7 


Ramos- Stimulated with 
PMA/ionomycim 14h ; 


0.0 


XF-498- CNS 


,o.o 1 : 


MEG-01-'£hronic myelogenous 

IpuVptnia ^mppnlcarvoblast i 


o:o , 


oiNrwo- ouoma i 


fi 0 

u.v 


Rnii- "Rnrlrltf'fl Ivmnhnma 

JLvdJl— JJ> LL1 is_l 1 1 o J j lll^JiM.\Jiiu* 


0.0 


SF-268- Glioblastoma 


0.0 


Daudi- Burkitt's lymphoma 


0.0 


T98G- Glioblastoma 


0.0 


U266- B-cell plasmacytoma 


0.0 


SK-N-SH- Neuroblastoma 
(metastasis) 


0.0 


CA46- Burkitt's lymphoma 


0.0 


SF-295- Glioblastoma 


0.0 


RL- npn-Hodgkin's B-cell 
lymphoma 1 


49.7 


Cerebellum 


0.0 


JTJll- pre-B-cell lymphoma 


0.0 


Cerebellum 


iO.O 


Jurkat- T cell leukemia 


0.0 


NCI-H292- Mucoepidermoid 
lung carcinoma ' • ! 


• 1 - ■ j 

A/1 A 


i i 

IJr-l- JLy ry I J IT (j 1 C LIKC IIAJct 


0.0 


DMS-114- Small cell lung 
cancer 


U.U 


nu i /o- i ~ucii lyjLjLjpiiuiiia 


0.0 


DMS-79- Small cell lung 
cancer 


A A 

u.u 


uyj /- iiisnocyxic lympnoma 


0.0 


NCI-H 146- Small cell lung 
cancer 


A A 

U.U 


ivu-oiz- iviyeiogenous leuKcimd 


0.0 


NCI-H526- Small cell lung j 
cancer 


A A 

U.U j 


/ oy^Jr ~ v^iear ceii rcnai uaiotiiuitixi | 


27.2 


NCI-N417- Small cell lung 
cancer 


n A 


L^aKi-z- v^iear ceu reiiai oaiouiuiiia 

. .... . 


0.0 


NCI-H82- Small cell lung 
cancer 


A A 
U.U 


0"\TE 7 QTQ floor /-.oil riatiol rarnnrtTna 
oW 5jy- k^ieal Cell Iclldl Cal OlllUUJa 


0.0 


NCI-H157- Squamous cell 
lung cancer (metastasis) 


a n 
U.U 


r^/lA 1 AA7 i 1 rvi c ' himAt 

VJJ^U 1 Wimio LUIIIOI i 


0.0 


NCI-H 1155- Large cell lung 
cancer 


0.0 


Hs766T- Pancreatic carcinoma (LN , 
metastasis) 


0.0 


NCI-H1299- Large cell lung 
cancer 


16.4 


CAPAN-1- Pancreatic 
adenocarcinoma (liver metastasis) 


0.0 


NCI-H727- Lung carcinoid 


0.0 


SU86.86- Pancreatic carcinoma 
(liver metastasis) 


0.0 


NCI-UMC-ll-Lung 
carcinoid 


0.0 


BxPC-3- Pancreatic 
adenocarcinoma 


0.0 


LX- 1 - Small cell lung cancer 


U.U 


HPAC~ Pancreatic adenocarcinoma 


0 0 


Colo-205- Colon cancer 


0.0 


MIA PaCa-2- Pancreatic carcinoma 


0.0 


KM 12- Colon cancer 


0.0 


CFPAC-1- Pancreatic ductal 
adenocarcinoma 


0.0 


KM20L2- Colon cancer 


0.0 


PANC-1- Pancreatic epithelioid 
ductal carcinoma 


0.0 


NCI-H71 6- Colon cancer 


0.0 


T24- Bladder carcinma (transitional 
cell) 


0.0 


SW-48- Colon 


0.0 


5637- Bladder carcinoma 


0.0 
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adenocarcinoma 








SW1 116- Colon 
adenocarcinoma 


0.0 

! : 1 _ ; . 


HT-1 1 97- Bladder carcinoma 


17.6 

— < — j — — 


LS174T- Colon 
adenocarcinoma 1 


0.0 


yM-UC-3- Bladder carcinma j 
(transitional cell) 


0.0 


SW-948- Colon 
adenocarcinoma 


0.0 


A204- Rhabdomyosarcoma 


0.0 


SW-480- Colon 
adenocarcinoma 


0.0 


HT-1 080- Fibrosarcoma i 


, 0.0 


NCI-SNU-5- Gastric 
carcinoma 


•0.0 


MG-63- Osteosarcoma 


— — " — *~ ; — ■ — 

o.o' ; 


KATO III- Gastric carcinoma 


0.0 


SK-LMS-1- Leiomyosarcoma 
(vulva) 

— — — - — " — — - — ■ — - 


0.0 

; -ss — ; 


NCI-SNU-1 6- Gastric . ' 
carcinoma 


0.0 


SJRH30- Rhabdomyosarcoma (met 
to bone marrow) 


0.0 


NCI-SNU-1- Gastric 
carcinoma 


0.0 


A431- Epidermoid carcinoma 


0.0 ' 


RF-1- Gastric 
adenocarcinoma 


o.o i 


WM266-4- Melanoma j 


0.0 


RF-48- Gastric 
adenocarcinoma 


■ b.b ! 


DU 145- Prostate carcinoma (brain 
metastasis) 1 


0.0 


MKN-45- Gastric carcinoma 


0.0 


MDA-MB-468- Breast ; 
adenocarcinoma 


37.6 


NCI-N87- Gastric carcinoma 


0.0 


SCC-4- Squamous cell carcinoma g 
of tongue 


0.0 


OVCAR-5- Ovarian 
carcinoma 


0.0 


SCC-9- Squamous cell carcinoma 
of tongue 


0.0 


RL95-2- Uterine carcinoma 


0.0 


SCC- 15- Squamous cell carcinoma 
of tongue 


0.0 


HelaS3- Cervical 
adenocarcinoma 


0.0 


CAL 27- Squamous cell carcinoma 
of tongue 


100.0 



Table 76 . Panel 4D 



Tissue Name 


Rel. Exp.(%) Ag2907, 
Run 157284733 


Tissue Name ! 


Rel. Exp.(%) Ag2907, 
Run 157284733 


Secondary Thl act 


0.0 


HUVEC IL-lbeta \ 


0.0 


Secondary Th2 act 


0.0 


HUVEC IFN gamma 


0.0 


Secondary Trl act 


0.0 


HUVEC TNF alpha + IFN 1 
gamma 


0.0 


Secondary Thl rest 


0,0 


HUVEC TNF alpha + IL4 


0.0 


Secondary Th2 rest 


0.0 


HUVEC IL-11 


0.0 


Secondary Trl rest 


0.0 


Lung Microvascular EC none 


0.0 


Primary Thl act 


12.3 


Lung Microvascular EC 
TNFalpha + IL-1 beta 


10.4 


Primary Th2 act 


0.0 


Microvascular Dermal EC none \ 


11.2 


Primary Trl act 


0.0 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


0.0 


Primary Thl rest 


11.7 


Bronchial epithelium TNFalpha ' 
+ ILlbeta 


12.9 


Primary Th2 rest 


0.0 


Small airway epithelium none 


24.0 


Primary Trl rest 


0.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


100.0 


CD45RA CD4 lymphocyte 


0.0 


Coronery artery SMC rest 


0.0 
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act 








CD45RO CD4 lymphocyte 
act 


6.1 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


0.0 


CD8 lymphocyte act 


0.0 


Astrocyte's rest 


0.0 


lymphocyte rest 


0.0 


Astrocytes TNFalpha + IL-lbeta 


12.0 


Secondary CD8 
lymphocyte act 


0.0 


KU-812 (Basophil) rest 


0.0 


CD4 lymphocyte none 


4.5 


KU-8 12 (Basophil) 
PMA/ionomycin 


0.0 


2ryThl/Th2/Trl anti- 
CD95CH11 


0.0 


CCD 1 106 (Keratinocytes) none 


0.0 


LAK cells rest 


u.u 


CCD 1 106 (Keratinocytes). 
TNFalpha + IL-lbeta 


0.0 


LAK cells IL-2 


13.4 


Liver ciiThosis v 


35.1 


LAK cells IL-2+IL- 12 


0.0 


Lupus kidney 


0.0 


LAK cells IL-2+IFN 
gamma 


0.0 


NCI-H292 none 


33.0 


LAK cells IL-2+ IL-18 


0.0 


NCI-H292 IL-4 


36.6 


LAK cells 
PMA/ionomycin 


0.0 


NCI-H292 IL-9 


23.3 


NK Cells IL-2 rest 


0.0 


NCI-H292 IL-13 


21.2 


Two Way MLR 3 day 


0.0 


NCI-H292 IFN gamma 


11.6 


Two Way MLR 5 day 


0.0 ; 


HPAEC none 


13.3 


Two Way MLR 7 day 


0.0 


HPAEC TNF alpha + IL-l beta j 


0.0 


PBMC rest 


0.0 


Lung fibroblast none 


0.0 


PBMC PWM 


11./ : 


Lung fibroblast TNF alpha + IL-i 
1 beta 


0 0 
u.u 


PBMC PHA-L 


0.0 


Lung fibroblast IL-4 


0.0 


Ramos (B cell) none 


0.0 


Lung fibroblast IL-9 


0.0 


Ramos (B cell) ionomycin 


0.0 


Lung fibroblast IL-13 


0.0 


B lymphocytes PWM 


o.o : 


Lung fibroblast IFN gamma \ 


0.0 


B lymphocytes CD40L 
and IL-4 


n o 
u.u 


i^ermai iiorooiasi ^jl/iu/u rcbii 


0.0 


EOL-1 dbcAMP 


0.0 < 


Dermal fibroblast CCD1070 
TNF alpha 


0.0 


EOL-1 dbcAMP 

Y lYLrV/lOIlUIIiyOlll 


0.0 


Dermal fibroblast CCD 1070 IL- , 
1 beta 


0.0 


Dendritic cells none 


0.0 


Dermal fibroblast IFN gamma 


0.0 


Dendritic cells LPS 


0.0 


Dermal fibroblast IL-4 j 


0.0 


Dendritic cells anti-CD40 


0.0 


IBD Colitis 2 


13.1 


Monocytes rest 


0.0 


IBD Crohn's 


6.9 


Monocytes LPS 


26.1 


Colon 


0.0 


Macrophages rest 


0.0 


Lung ; 


0.0 


Macrophages LPS 


0.0 


Thymus 


16.0 


HUVEC none 


0.0 


Kidney 


13.1 


HUVEC starved 


0.0 







Panel 1.3D Summary: Ag2907 Results from two experiments with the same 



probe/primer set are in good agreement. Expression of the CG56097-01 gene is highest in a 

sample derived from pituitary tissue (CTs = 31-33) with little to no expression detected in any 
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other tissue. Thus, expression of this gene could be used to distinguish pituitary gland from the 

other samples on this panel. 

The protein encoded for by this gene is most homologous to a glucuronosyltransferase 

that is normally found in liver. UDP glycosyltransferases (UGT) are a superfamily of enzymes 
5 that catalyze the addition of the glycosyl group from a UTP-sugar to a small hydrophobic 

molecule. Glucuronosyltransferases are membrane-bound microsomal enzymes that catalyze 

the transfer of glucuronic acid to a wide variety of exogenous and endogenous lipophilic 

substrates. These enzymes are of major importance in the detoxification and subsequent 

elimination of xenobiotics such as drugs and carcinogens. The pituitary plays a major role in 
1 0 the' physiology of many different systems in the body. Therefore, the CG5 6097-01 gene may 

play an essential role in maintaining proper function of the pituitary gland and many of its 

' secreted peptides. Furthermore, therapeutic modulation of the activity of this gene or its 

protein product using small molecule drugs may be useful for the treatment of diabetes and 

obesity as well as growth, reproductive, and endocrine disorders. 
1 5 Panel 2.2 Summary: Ag2907 Expression of the CG56097-01 gene is 

low/undetectable (CTs >35) across all of the samples on this panel (data not shown). 

Panel 2D Summary: Ag2907 Expression of the CG56097-01 gene is highest and 

almost exclusive to a sample derived from a prostate cancer (CT - 31.7). Thus, the expression 

of this gene could be used to distinguish prostate cancer from the other samples in the panel. 
20 Moreover, therapeutic modulation of the activity of this gene or its protein product, through 

the use of small molecule drugs, protein therapeutics or antibodies, might be of benefit in the 

treatment of prostate cancer. 

Panel 3D Summary: Ag29Q7 Expression of the CG56097-01 gene is highest in a 

sample derived from a squamous cell carcinoma cell line (CT = 33.8). Thus, the expression of 
25 this gene could be used to distinguish this sample from the other samples in the panel. 

Panel 4D Summary: Ag2907 Expression of the CG56097-01 gene is detected at a 

very low level in small airway epithelium treated with the inflammatory cytokines TNP-a and 

IL-lb (CT = 34.2). Thus, expression of this gene may be a marker of loss of homeostasis in 

this cell type. 
30 J. NOV14b: PROSTASIN PRECURSOR 

Expression of theNOV14b gene (CG56123-02) was assessed using the primer-probe 

set Ag3360, described in Table 77. 

Table 77 . Probe Name Ag3360 

Primers] ~ Sequences 
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1 Position 


Forward 


5 ' -gtacactctggcctccagctat-3 ' (-SEQ ID NO:238) 


22. } 1017 


Probe 


TET-5 1 -ctcctggatccaaagcaaggactctg-3 ' -TAMRA (SEQ 
ID NO: 23 9)' • 


2§ | , 1041 


Reverse 


5 1 -gaatgggctcaaagatcaagat ~ 3 • ' '{SEQ ID NO:240) 


22 j 1084 



10 



15 



CNSneurodegenerationvl.O Summary: A g3 3 60 Expression of the CG56123-02 
gene is low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

General_screening_panel_vl.4 Summary: Ag3360 Expression of the CG561 23-02 
gene is low/undetectable (CTs > 55) across all of the samples on this panel (data not shown). 

Panel 4D Summary: Ag3360 Expression of the CG56123-02 gene is 
low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 
K, NOV15: LDLB-Iike 

Expression of the NOV15 gene (CG50153-Q1) was assessed using the primer-probe 
sets Ag2452 and Ag2002, described in Tables 78-79. Results of the RTQ-PCR runs are shown 
in Tables 80-83. Please note that this gene was previously incorrectly called LDL receptor- 
like. However, this gene is most homologous to the mouse LDLB gene, encoding a cytosolic 
protein that is essential for normal Golgi function (Chatterton JE, Hirsch D, Schwartz JJ, 
Bickel PE, Rosenberg RD, Lodish HF, Krieger M. Expression cloning of LDLB, a gene 
essential for normal Golgi function and assembly of the IdlCp complex. Proc Natl Acad Sci U 
SA 1999 Feb 2;96(3):9 15-20). 
Table 78 . Probe Name Ag2452 



Primers 


Sequences 


Length 


1 Start 
Position 


Forward 


5 » -agcagtgcagttgtgaaagttt-3 ■ (SEQ ID NO:241) 


22 


| 2053 


Probe 


TET-5 • -tgattcatggattcacccagtcatta-3 » -TAMRA (SEQ 
ID NO:242) 


26 


2075 


Reverse 


5 * -cagaactgagccagcatcat-3 1 (SEQ ID NO: 243) 


20 | 2108 


Table 79. Probe Name Ag2002 


Primers 


Sequences 


Length 


Start 
Position 


Forward 


5 • -gccagaaaggcaactattcag-3 1 (SEQ ID NO: 244) 


21 


697 


Probe 


TET-5 ' -aacttctcaaccagccacaccatggt-3 ' -TAMRA (SEQ 
ID NO: 24 5) 


26 


719 


Reverse 


5 1 -agcaactccactaatgagcaaa-3 ' (SEQ ID NO:246) 


22 ; 


764 


Table 80. CNS neurodegeneration vl.O 



Tissue Name 


Rel. Exp.(%) Ag2452, Run 
206266095 


Tissue Name 


Reh Exp.(%) Ag2452, Run 
206266095 


AD 1 Hippo 


8.0 


Control (Path) 3 
Temporal Ctx 


4.2 


AD 2 Hippo 


36.3 


Control (Path) 4 
Temporal Ctx 


35.6 
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AD 3 Hippo 


2.9 


AD 1 Occipital Ctx 


5.2 


AD 4 Hippo 


5.-3 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 hippo 


52.1 


AD 3 Occipital Ctx 


. , , 2j 


AD 6 Hippo 


47.0 


AD 4 OccipitaJ Ctx 


27.9 


Control 2 Hippo 


37.4 


At) 5 Occipital Ctx 


15.7 


Control 4 Hippo 


7.7 


AD 6 Occipital Ctx 


49.3 


Control (Path) 3 Hippo 


i.o 


Lonnoi l wooipiidi v^ia 


2 1 


AD 1 Temporal Ctx 


7.9 


Control' 2 Occipital Ctx 


65.1 


AD 2 Temporal Ctx 


49.0 


Control 3 Occipital Ctx 


10.0 


AD 3 Temporal Ctx 


4.1 


Control 4 Occipital Ctx 


5.2 


AD 4 Temporal dtx 


28.3 


Control (Path) 1 
uccipnai ^rx 


87.1 


AD 5 Inf Temporal Ctx 


76.3 


Control (Path) 2 
occipnai utx 


9.0 


AD 5 SupTemporal Ctx 


32.8 


Control (Path) 3 
uccipjiai v_>rx • 


1.5 


AD 6 Inf Temporal Ctx 


46,7 

i * 


'Control (Path) 4 
Occipital utx 


11.9 


AD 6 Sup Temporal Ctx j 


42.0 


Control 1 Parietal CtX 


, 5.0 


Control 1 Temporal Ctx 


3.7 


Control 2 Parietal Ctx 




Control 2 Temporal Ctx 


51.1 


Control 3 Parietal Ctx 


15.5 


Control 3 Temporal Ctx 


14.0 


Control (Path) 1 
Parietal Ctx 


87.1 


Control 4 Temporal Ctx 


7.5 


Control (Path) 2 
Parietal Ctx 


24.7 


Control (Path) 1 
Temporal Ctx 


100.0 


Control (Path) 3 
Parietal Ctx 


1.3 


Control (Path) 2 
Temporal Ctx 


45.1 


Control (Path) 4 
Parietal Ctx 


41.8 



Table 81 . Panel 1.3D 



Tissue Name 


Rel. Exp.(%) 
Ag2002, Run 
147805868 


ReLExp.(%) 
Ag2452,Run 
155896645 


Tissue Name 


Rel. Exp.(%) 
Ag2002, Run 
147805868 


Rel. Exp.(%) 
Ag2452, Run 
155896645 


Liver 

adenocarcinoma 


15.9 


6.1 ; 


Kidney (fetal) 


9.7 


5.9 


Pancreas 


5.0 


3.1 


Renal ca. 786-0 , 


6.8 


2.7 


Pancreatic ca. 
CAP AN 2 


3.8 


1.7 


Renal ca. A498 j 


34.9 


14.8 


Adrenal gland 


12.7 


7.7 


Renal ca. RXF 
393 


6.9 


1.3 


Thyroid 1 


13.6 


6.7 


Renal ca. ACHN 


24.5 


1.4 


Salivary gland 


7.3 


4.9 


Renal ca.UO-31 


15.5 


3.7 


Pituitary gland 


23.8 


24.8 


Renal ca.TK-10 


14.9 


4.6 


Brain (fetal) 


8.5 


8.9 


Liver 


2.8 


2.9 


Brain (whole) 


33.9 


18.9 


Liver (fetal) 


7.9 


7.1 


Brain (amygdala) 


19.6 


28.9 


Liver ca. 
(hepatoblast) 
HepG2 i 


28.1 


5.8 


Brain (cerebellum) 


8.5 


9.1 


Lung 


7.5 


11.7 


Brain (hippocampus) 


48.6 


100.0 


Lung (fetal) 


14.6 


7.6 
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Brain (substantia 
nigra) 


5.3 


4.3 


Lung ca. (small 
cell) LX-1 


16.6 


3.1 


Brain (thalamus) 


15.4 


}3A , 


Lung ca. (small 
cell)NCr-H69 > 


36.1 


14.7 . 


Cerebral Cortex 


100.0 


41.2 


Lung ca. (s.cell 
var.)SHP-77 ' 


30.6 


15.6 


Spinal cord 


8.5 


5.3 


Lung ca. (large 
cell)NCI-H460 


4.5 


2.2 


glio/astro U87-MG 


14.4 ' 


5.3 ' ! 


Lung ca. (non- 
sm. cell)A549 


12.0 


.8.2 


glio/astro U-118- 
MG 


39.5 


20.0 


Lung ca. (non- 
s.cell) NCI-H23 


15.4 


3.8 




astrocytoma 
SW1783 


25.5 » j 


10.3 


Lung ca. (non- 
s.cell) HOF-62 ■ 


21.8 


5.1 


neuro*; met SK-N- 
AS 


36.9 


34.6 


Lung ca. (non- 
s.cl) NCI-H522 ! 


18.3 


- " - r 

5.5 


astrocytoma SF-539 


12.0 


3.8 \ 


Lung ca. 

(squam.) SW 900; 


9.8 


4.0 


astrocytoma SNB-75 


33.7 


6.7 


Lung ca. 
(squam.) NCI- 
H596 


14.7 


3.1 


glioma SNB-19 


16.0 


3.9 


Mammary gland 


27.5 


11.2 


glioma U251 


0.0 : 

.... 


3.9 


Breast ca.* 
(pl.ef) MCV-i 


23.7 


7.3 


glioma SF-295 


28.9 


8.2 


Breast ca.* = 
(pl.ef) MDA~ 
A/TFU931 


39.8 


23.7 


Heart (fetal) 


55.1 


9.6 


Breast ca.* j 
frvl eft T47D 


37.1 


8.4 


Heart 


7.2 


2.7 


Breast ca. BT- 


16.4 


11.0 


Skeletal muscle 
(tetal) 


84.i : 


24.8 


Breast ca. MDA- 

IN 


20.6 


8.7 


Skeletal muscle 


8.4 


3.8 


Ovary 1 


52.5 


17.6 


Bone marrow 


3.1 


4.5 


Ovarian ca. 
OVCAR-3 


19.9 


4.9 


Thymus 


7.5 


3.3 


Ovarian ca. 
OVCAR-4 


3.3 


0.9 


Spleen 


12.7 


9.0 


Ovarian ca. 
OVCAR-5 


32.5 


7.0 


Lymph node 


12.9 


4.1 


Ovarian ca. 
OVCAR-8 


14.4 


5.4 


Colorectal 


18.7 


5.9 


Ovarian ca. 
IGROV-1 


3.8 


1.9 


Stomach 


17.7 


5.2 


Ovarian ca.* 
(ascites) SK-OV- 
3 


12.0 


4.4 


Small intestine 


10.4 


10.4 


Uterus 


14.2 


7.6 


Colon ca. SW480 


34.2 


7.8 


Placenta 


13.2 


7.9 


Colon ca * 
SW620(SW480 met) 


17.1 


6.0 


Prostate 


6.8 


6.0 


Colon ca. HT29 


10.2 


3.8 


Prostate ca.* 
(bone met)PC-3 


18.4 


8.1 
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Colon ca.HCT-'l 16 


9.2 


5.8 


Testis 


19.6 


10.6 


Colon ra PaPo-2 


ii.2 


5.4 


Melanoma 
Hs688(A).T 


28.9 


3.7 


Colon C3. 
tissue(OD03S66) 


"'i 

15.7 


5.3 ' ' 


Melanoma* 
(met) 


" ■■; 'i' 

45.7 


■ 2.3 


Colon ca. HCC-2998 


14.9 


10.7 


Melanoma 
UACC-62 


3.3 


1.1 


Gastric ca.* (liver 
met) NCI-N87 


33.6 


10.7 

i 


Melanoma M 14 

— - — i — » 


3.5 


1.2 


Bladder 


i 

5.3 ; 


3.8 


Melanoma LOX 
IMVI 


6.7 


9.1 ' 


Trachea 1 


14.0 


33.2 


Melanoma* 
(met) SK-MEL-5 


13.7 


12.9 


Kidney 


3.3 


2.3 


Adipose 


4.6 


2.6 



Table 82 . Panel 2D 



Tissue Name 


Rel. Exp.(%) Ag2452, 
Run 155897997 ■ 




Rel. Exp.(%) Ag2452, 
Run 155897997 


Normal Colon 




Kidney Margin 8120608 


24.5 


CC Well to Mod Diff 
(OD03866) 


18.2 


Kidney Cancer 81^0613 


' 51.1 


CC Margin (OD03866) 


19.1 


Kidney Margin 8120614 


29.1 


CC Gr.2 rectosigmoid 
(OD03868) 


8.0 


Kidney Cancer 9010320 


19.3 


CC Margin (OD03868) 


6.0 


Kidney Margin 9010321 


31.4 


CC Mod Diff (ODO3920) 


23.3 


Normal Uterus 


6.8 


CC Margin (ODO3920) 


24.0 


Uterus Cancer 064011 


32.1 


CC Gr.2 ascend colon 
(OD03921) 


91.4 


Normal Thyroid 


29.9 


CC Margin (OD03921) 


19.8 


Thyroid Cancer 064010 


28.3 


CC from Partial Hepatectomy 1 
(ODO4309)Mets 


66.4 


Thyroid Cancer A302152 


17.1 


Liver Margin (ODO4309) 


21.2' 


Thyroid Margin A302153 


29.9 


Colon mets to lung (OD04451- 
01) 


24.3 


Normal Breast 


25.7 


Lung Margin (OD04451-02) 


14.7 


Breast Cancer (OD04566) 


15.3 


Normal Prostate 6546-1 


33.9 


Breast Cancer (OD04590- 
03) 


76.8 


Prostate Cancer (OD04410) 


38.7 


Breast Cancer Mets 
(OD04590-03) 


68.3 


Prostate Margin (OD04410) 


35.8 


Breast Cancer Metastasis 
(OD04655-05) 


77.9 


Prostate Cancer (OD04720-01) 


52.5 


Breast Cancer 064006 


14.2 


Prostate Margin (OD04720-02) 


68.3 


Breast Cancer 1024 


24.3 


Normal Lung 061010 


35.8 


Breast Cancer 9100266 


68.3 


Lung Met to Muscle 
(OD04286) 


28.3 


Breast Margin 9100265 


31.6 


Muscle Margin (OD04286) 


17.8 


Breast Cancer A209073 


35.4 


Lung Malignant Cancer 
(OD03126) 


35.6 


Breast Margin A2090734 


22.8 


Lung Margin (OD03126) 


45.1 


Normal Liver 


9.6 
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Lung Cancer (OD04404) 


17.9 


Liver Cancer 064003 


9.3 


Lung Margin (OD04404) 


18.0 


Liver' Cancer 1025 


9.7 


Lung Cancer (OD04565) 


63 


Liver Cancer 1026 


9.6 


Lung Margin (OD04565) 


8.2 


Liver' Cancer 60t)4-T 


13.6 


Lung Cancer (OD04237-01) 


50.0 


Liver Tissue 6004-N 


18.8 


Lung Margin (OD04237-02) 


16.8 


Liver Cancer 6005-T 


10.3 


L/vUiclI ivlCI iviCL lu .U1VCI 

(ODO4310) 


19.9 


Liver Tissue 6005-N 


1.7 


Liver Margin (ODO4310) 


18.6 


Normal Bladder 


53.2 


lVJ.CIaIlUIila IvlClo IU _L>IU1£ 

(OD04321) 


35.1 


Bladder Cancer 1023 


37.1 


i_,ung jviargin \^ju\jh jzi j 


35.1 


Bladder Cancer A302173 


26.6 


Normal Kidney 


• - i 1 — • '•- 

57.4 


Bladder Cancer 
(OD047 18-01) 


46.3 ( 


Kidney Ca, Nuclear grade 2 
(OD04338) 


58.2 


Bladder Normal Adjacent : 
(OD047 18-03) 


24.8 


jvianey iviargin [\JLs\j £ i3jc>) 


30.1 


Nnrnial Ovarv 


41.8 


Kidney Ca Nuclear grade 1/2 
(OD04339) 


26.4 


Ovarian Cancer 064008 


54.0 


Kidney Margin (OD04339) 


35.4 


Ovarian Cancer ' 
(OD04768-07) 


•sc. o 

/o.o 


Kidney Ca, Clear cell type 
(OD04340) 


7 

JO. / 


Ovary Margin (OD04768- 
08) 


10.5 


Kidney Margin (OD04340) 


28.7 


Normal Stomach 


33.0 


Kidney Ca, Nuclear grade 3 
(OD04348) 


18.3 


Gastric Cancer 9060358 


11.5 


Kidney Margin (OD04348) 


25.7 


Stomach Margin 9060359 


28.5 


Kidney Cancer (OD04622-01) 


18.4 


Gastric Cancer 9060395 


35.1 


Kidney Margin (OD04622-03) 


7.0 


Stomach Margin 9060394 


40.3 


Kidney Cancer (OD04450-01) 


25.7 


Gastric Cancer 9060397 


71.7 


Kidney Margin (OD04450-03) 


24.1 


Stomach Margin 9060396 


18.2 


Kidney Cancer 8120607 


13.2 


Gastric Cancer 064005 


35.1 



Table 83 . Panel 4D 



Tissue Name 


Rel.Exp.(%) 
Ag2002,Run 
165826206 


Rel.Exp.(%) ; 
Ag2452, Run 
155898869 


Tissue Name 


Rel.Exp.(%) 
Ag2002, Run 
165826206 


Rel. Exp.(%) 
Ag2452, Run 
155898869 


Secondary Thl act 


25.3 


27.4 


HUVEC IL-lbeta 


15.8 


24.7 


Secondary Th2 act j 


35.8 


16.7 


HUVEC IFN gamma 


37.6 


45.4 


Secondary Trl act 


36.9 


46.3 


HUVEC TNF alpha + j 
EFN gamma 


29.3 


27.4 


Secondary Thl rest 


25.9 


13.4 


HUVEC TNF alpha + 
1L4 


27.5 


20.4 


Secondary Th2 rest 


18.9 


17.8 


HUVEC IL-11 


13.9 


11.7 


Secondary Trl rest 


23.0 


17.0 


Lung Microvascular 
EC none 


21.2 


26.4 


Primary Thl act 


15.9 


31.0 


Lung Microvascular 
EC TNFalpha + IL- 
lbeta 


29.1 


34.9 


Primary Th2 act 


31.0 


26.8 


Microvascular 
Dermal EC none 


25.5 


36.3 


Primary Trl act 


30.4 


35.8 


Microsvasular Dermal 


25.7 


28.7 
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EC TNFalpha + IL- 
Ibeta ' 




. > 


jrijjiidry nil rcbi 


/ O.o 




Bronchial epithelium 
TNFalpha + ILlbeta * 


23.0 


U.J 


Primary Th2 rest 


33.9 


39.2 


Small airway 
epithelium none 


18.8 


17.7 


Primary Trl rest 


29.3 


23.2 


Small airway 
epithelium TNFalpha 
+ IL-1 beta 


44.4 


51.1 


CD45RA CD4 
lymphocyte act 


26.1 


30.1 


Coronery artery S/VIC 
rest ' 


27.9 


45.4 


CD45RO CD4 
lymphocyte act 


37.1 


34.9 : 


Coronery artery SMC 
TNFalpha + IL-1 beta ; 


22.4 


25.0 


CD8 lymphocyte act • 


21.9 


16.8 


Astrocytes rest 


40.9 


24.0 

: , 1 


Secondary CD 8 
lymphocyte rest 


26.8 


24.3 


Astrocytes TNFalpha 
+ IL-1 beta 


58.6 


17.4 


Secondary CD8 ; 
lymphocyte act 


15.1 




KU-8 12 (Basophil) 
rest 


Zo.D 


-51 n 
j I .u 


CD4 lymphocyte 
none 


23.0 


19.6 ; 


KU-8 12 (Basophil) 
PMA/ionomycin 


o3. / 


1 


2ry 

Thl/Th2/Trl anti- i 
CD95CH11 ! 


29.3 


22.5 


CCD! lUo 

(Keratinocytes) none 


17.1 


18.9 


LAK cells rest 


17.0 


21.5 


CCD 1106 
(Keratinocytes) 
TNFalpha + IL- Ibeta | 


80.1 


2.6 


LAK cells IL-2 


33,2 


22.8 


Liver cirrhosis 


16.7 


3.3 


LAK cells IL-2+IL- 
12 


33.2 


18.4 


Lupus kidney 


21.9 


5.1 


LAK cells IL-2+IFN 
gamma 


35.1 


37.9 


NCI-H292 none 


24.0 


46.3 


LAK cells IL-2+ IL- 
18 


30.1 


35.6 


NCI-H292 IL-4 


24.3 


42.6 


LAK cells 
PMA/ionomycin 


5,7 


6.0 


NCI-H292 IL-9 


25.2 


58.6 


NK Cells IL-2 rest 


24.1 


19.3 


NCI-H292 IL-13 


11.7 


32.3 


Two Way MLR 3 
day 


29.3 


28.9 


NCI-H292 IFN 
gamma 


14.8 


37.9 


Two Way MLR 5 
day 


22.4 


15.3 


HPAEC none 


23.2 


25.7 


Two Way MLR 7 
day 


21.6 


12.1 


HPAEC TNF alpha + 
IL-1 beta 


39.2 


44.4 


PBMC rest 


14.7 


12.4 


Lung fibroblast none 


40.6 


26.2 


PBMC PWM 


22.4 


57.8 


Lung fibroblast TNF 1 
alpha + IL-1 beta 


71.2 


31.6 


PBMC PHA-L 


11.5 


28.7 


Lung fibroblast IL-4 


52.9 


66.4 


Ramos (B cell) none 


27.2 


21.5 


Lung fibroblast IL-9 ; 


29.9 


67.8 


Ramos (B cell) 
ionomycm 


16.8 


66.9 JLung fibroblast IL-13 


33.0 


35.1 


B lymphocytes PWM 


23.7 


65 - jLung fibroblast IFN 
j gamma 


45.1 


77.4 


B lymphocytes 
CD40L and IL-4 


23.7 


~ 7 _ (Dermal fibroblast 
]CCD1 070 rest 


563 


83.5 
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EOL-1 dbcAMP 


8.8 


6.0 


Dermal fibroblast 
CCD 1070 TNF alriha 


84.7 ( 


100.0 


EOL-1 dbcAMP 
PMA/ionomycin 


. 8.3 


6.6 


Dprmal fihrnMact 

CCD1070IL-Ibeta 


39.8 P 


45.4 


Dendritic cells none 


14.0 


10.1 


Dermal fibroblast 1FN 
gamma 


15.5 


19.5 


Dendritic cells LPS 


14.0 


10.4 


Dermal fibroblast IL- 
4 


29.3 


42.3 


Dendritic cells anti- 
CD40 


19.5 


15.0 


IBD Cojitis 2 


4.8 


2.2 


Monocytes rest 


25.3 


22.5 


IBD Crohn's 


7.3 


4.7 • 


Monocytes LPS 


25.3 


20.0 


Colon 


100.0 


37.9 


Macrophages rest 


21.6 


24.8 


Lung 


15.1 


26.6 


Macrophages LPS 


• 1'6.5 


13.7 


Thymus 


40.1 


55.1 


HUVEC none 


31.2 


36.9 


Kidney 1 35.4 


67.8 


HUVEC starved 


45.4 


55.9 


1 



CNS_neurodegeneration_vl.O Summary: Ag2452 The CQ501 53-01 gene is 
expressed in most of the samples in this pane] with highest expression detected in the temporal 
cortex of a control patient (CT=29.4). This panel confirms the expression of the CG56071-01 
gene in the CNS in an independent group of patients. However, np differential expression was 



5 found between Alzheimer's disease and control postmortem brains in this experiment. Please 
see Panel 1.3D for a discussion of the potential utility of this gene in the central nervous 
system. 

Panel 1.3D Summary: A£2002/Ag2452 Two experiments with two different 
probe/primer sets gave results that are in very good agreement, with highest expression in both 

10 runs occurring in regions of the brain. Expression of the CG501 53-01 gene is highest in the 
cerebral cortex (CTs=26) in one run and the hippocampus in the other (CT=27), with 
significant expression also detected in the amygdala. This expression pattern indicates a 
potential role for the CG501 53-01 gene product in Alzheimer's disease (AD), since this gene is 
expressed in the regions of the brain important to AD pathology. Therefore, the CG50153-01 

15 gene product may be a promising antibody or small molecule target for the treatment of 
Alzheimer's disease. 

High levels of expression of this gene are also detected in cell lines derived from brain 
cancer, breast cancer, lung cancer, kidney cancer and melanoma. In addition, the expression in 
normal ovary seems to be higher than in cell lines derived from ovarian cancer tissues. Thus, 
20 the expression of this gene could be of use as a marker or as a therapeutic for these cancers. 

The CG501 53-01 gene is widely expressed in tissues with metabolic and endocrine 
function, including adrenal gland, pituitary gland, thyroid, pancreas, adipose, liver, skeletal 
muscle and heart. Therefore, therapeutic modulation of the activity of this gene or its protein 
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product using protein therapeutics, antibodies or small molecule drugs could be of benefit in 
the treatment of metabolic diseases such as obesity and diabetes, cardiovascular diseases and 
endocrine disorders. Significantly, this gene is expressed at higher levels in 'fetal skeletal 
muscle (CTs=27-30) than in adult skeletal muscle (CTs=30-33). This difference in expression 
5 suggests that the CG501 53-01 protein product could be involved in muscular growth or 

development in the fetus and therefore could act in a regenerative capacity in an adult. Thus, 
therapeutic modulation of the CO501 53-01* gene could be useful in the treatment of muscle 
related diseases and treatment with the protein product could restore muscle mass or function 
to weak or dystrophic muscle. 

10 Panel 2D Summary: Ag2452 Highest expression of the CG501 53-01 gene occurs in 

colon (CT=29.7). High levels of expression are also detectable in breast cancer, prostate 
cancer, ovarian cancer, and colon cancer when compared to their normal adjacent tissue. Thus, 
expression of the CG50153-01 gene could be used as a marker to detect the presence of these 
cancers. Moreover, therapeutic modulation of the activity of this gene or its protein product, 

1 5 using protein therapeutics, monoclonal antibodies, or small molecule drugs, could be of 
benefit in the treatment of breast, prostate, ovarian and colon cancer. 

Panel 4D Summary: Ag2002/Ag2452 Two experiments with two different 
probe/primer sets show highest expression of the CG501 53-01 gene in normal colon 
(Cl>26,2) and dermal fibroblasts treated with TNF-alpha (CT=29.2). High expression of this 

20 gene in colon confirms the result obtained in Panel 2D. Significant expression is also seen in 
fibroblasts, endothelial and epithelial cells, keratin ocytes, leukocytes, smooth muscle cells and 
normal kidney. The CG501 53-01 gene is expressed at much lower levels in colon from a 
patient with inflammatory bowel disease (IBD) when compared to expression in normal colon. 
Similarly, expression in lupus kidney is much lower than normal kidney. Thus, the protein 

25 encoded by the CG501 53-01 gene may be involved in normal tissue/cellular functions and at 
least in the kidney and colon and downregulation of this protein may serve as a diagnostic 
marker for lupus or IBD. 

L. NO VI 6a and NO VI 6b: TRAF5-like 

30 Expression of the NOV16a gene (CG56108-01) and the NOV16b gene (CG68108-02) 

was assessed using the primer-probe sets Ag3028 and Ag5257, described in Tables 84-85. 
Please note that Ag3028 recognizes both CG56108-01 and CG68108-02 variants, whereas 
Ag5257 specifically recognizes the CG681 08-02 variant Results of the RTQ-PCR runs are 
shown in Tables 86-90. In addition, please note that the CG561 08-01 gene was previously 
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incorrectly called TNF receptor-like; however, this gene is almost identical to the TRAF5 
gene. 

Table 84 . Probe Name Ag3028 " 



Primers 


Sequences 


Length 


Start 
Position 


Forward 


5 ■ -aaaggacagtcttgctcagctt-3 ' (SEQ ID NO:247) 


22 


656 


Probe 


TET-5 1 -caggacatacagccaggtgttcatc.t -3 1 -TAMRA (SEQ 
ID NO:248) 


26 


■63 0 


Reverse 


5 * -ccaacaattgtgcgaagattat-3 1 (SEQ 'ID NO: 24 9) 


22 | 593 


Table 85. Probe Name Ag5257 


Primers 

~* — z* — ■■ ■ 


Sequences 


Length 


Start 
Position 


Forward 


5 1 -ggtagatgaacacctggctgtat-3 1 (SEQ ID NO: 250) 


23 


627 


Probe 


TET-5 ' -cctgaagctgagcaagactgtccttttaagcac-3 ' -TAMRA 
(SEQ ID NO: 251) 


33 


.652 


Reverse 


5 1 -taagtcagaaatcgttacagcaca~3 1 (SEQ ID NO:252) 


24 


691 


Table 86. AI comprehensive panel vl.O 



Tissue Name 


Rpl Fvn (°/«\ 
Ag3028, Run 
229313563 


Ag5257, Run 
229930786 


Tissue Name 

i 


T?^1 T? Yn (OS \ 
XVCI» ILAJJ.^ /to) 

Ag3028, Run 
229313563 


RpI Fvn (%\ 
Ag5257, Run 
229930786 


110967 COPD-F 


31.6 


21.2 


112427 Match 
Control Psoriasis- 
F 


42.9 


59.5 


110980 COPD-F 


. 23.2 


20.3 


112418 Psoriasis- 
M 


23.2 


34.6 


110968 COPD-M 


45.7 


30.8 


1 12723 Match 
Control Psoriasis- 
M 


5.8 


6.0 


110977 COPD-M 


41.8 


60.3 


112419 Psoriasis- 
M 


42.9 


36.9 


110989 

Empbysema-F 


62.4 


65.1 


112424 Match 
Control Psoriasis- 
M 


29.3 


16.5 


110992 

Emphysema-F 


25.0 


17.3 


112420 Psoriasis- 
M 


100.0 


73.2 


110993 

Emphysema-F 


40.3 


37.4 


112425 Match 
Control Psoriasis- 
M 


36.1 


33.4 


110994 

Emphysema-F 


17.8 


15.2 


104689 (MF) OA 
Bone-Backus 


21.2 


15.2 


110995 , 
Emphysema-F 


38.4 


36.9 


104690 (MF) Adj 
"Normal" Bone- 
Backus 


12.2 


9.0 


110996 

Emphysema-F 


6.3 


9.8 


104691 (MF) OA 
Synovium-Backus 


30.1 


31.4 


110997 Asthma- 
M 


1.3 


2.0 


104692 (BA) OA 
Cartilage-Backus 


2.3 


2.6 


111001 Asthma-F 


35.6 


30.6 


104694 (BA) OA 
Bone-Backus 


19.9 


16.4 


111002 Asthma-F 


46.0 


38.4 


104695 (BA) Adj 


9.5 


14.7 
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"Normal" Bone- 
Backus ■ 






111003 Atopic 
Asthma-F 


37.4 


37,6 


104696 (BA) OA 
Synovium-Backus ■ 


30,1 


33.2 


111004 Atopic 
Asthrna-F 


39.0 


37.9 


104700 (SS)OA 
Bone-Backvis 


9.0 


14.3 


11100S Atonic 
Asthrna-F 


28.1 


33.4 


104701 (SS) Adj 
"Normal" Bone- 
Backus 


13.2 


11.1 


111006 Atopic 
Asthma-F 


7.9 


8.3 


104702 (SS) OA 
Syndvium-Backus' 


33.2 


21.2 


111417 Allergy- 
M 


20.0 


30.1 


117093 OA 
Cartilage Rep7 


46.3 


39.5 


112347 AHergy- 
M 


4.0 


1.6 


112672 OA 
Bone5 


33.0 


31.0 

i 


112349 Normal 
Lung-F 


9 0 


1.6 


112673 OA 
Synoviums 


17.3 


14.6 


1 1 9^S7 "Nrvrmal 

Lung-F 


13.9 


18.4 


112674 OA 
Synovial Fluid 
cells5 


17.8 


15.5 


112354 Normal 
Lung-M 


7 1 


lJ.O 


117100 OA 
Cartilage Repl4 


7.0 


9.6 


i in Lronns-r 


1 R 7 
1 o. / 


OQ 7 


112756 OA 
Bone9 


49.7 


47.0 


112389 Match 
Control Crohns-F 


19.8 


15.3 


112757 OA 
Synovium9 


1.8 


3.3 


112375 Crohns-F 


17.2 


17.3 


112758 OA 
Synovial Fluid 
Cells9 


19.3 


27.5 


112732 Match 
Control Crohns-F 


37.1 


47.0 


117125 RA 
Cartilage Rep2 


34.6 


36.6 


112725 Crohns-M 


12.9 


9.3 


1 1 3492 Bone2 RA 


16,0 


1 A O 

14. z 


112387 Match 
Control Crohns-M 


13.3 ! 


18.2 


ll 3493 

Synovi\un2 RA 


3.9 


3.8 


112378 Crohns-M 


3.4 


1.3 


H3494 Syn Fluid 
Cells JKA 


6.2 


5.9 


112390 Match 
Control Crohns-M 


48.0 


59.5 


113499 Cartilage4 
RA 


5.7 


9.0 


112726 Crohns-M 


22.1 


27.9 


113500 Bone4RA ; 


10.0 


7.5 


112731 Match 
Control Crohns-M 


15.7 


24.5 


113501 

Synovium4 RA 


5.7 


3.5 


112380 Ulcer 
Col-F 


40.9 


59.5 


113502 Syn Fluid 
Cells4 RA 


4.0 


6.8 


112734 Match 
Control Ulcer 
Col-F 


80.1 


50.7 


113495 Cartilage3 
RA 


5.2 


6.0 


112384 Ulcer 
Col-F 


89.5 


100.0 


1 13496 Bone3 RA 


5.3 


5.9 


112737 Match 
Control Ulcer 
Col-F 


7.9 


15.0 


113497 

Synovium3 RA 


3,2 


5.3 


112386 Ulcer 
Col-F 


10.6 


28.5 


113498 Syn Fluid 
Cells3 RA 


8.8 


6.0 


112738 Match 


6.9 


9.7 


117106 Normal 


7.3 


12.8 
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Contro] Ulcer 
Col-F 


Ml 

1 ► 


1 1 i 


Cartilage Rep20 






112381 Ulcer 
Col-M 


.2.4 , 


1.0 


1 13663 Bone3 , 
Normal 


9.5 /, 


19.9 


112735 Match . 
Control Ulcer 
Col-M 


41.8 


17.6 


113664 

Synovium3 

Normal 


1.3 


0.0 


112382 Ulcer 
Col-M 


32.3 


27.2 


113665 Syn Fluid 
Cells3 Normal 


3.9 


1.2 


112394 Match 
Control Ulcer 
Col-M 


7.9 


1 4.2 


117107 Normal 


10.0 


7.4 : 


112383 Ulcer 
Col-M 


40.1 


54.3 


1 13667 Bone4 
Normal 


18.6 


13.2 


112736 Match 
Control Ulcer , 
Col-M 


8.4 


12.9 


113668 

Synovium4 

Noirnal 


19.1 


14.6 


112423 Psoriasis- 
F 


40.9 


35.1 


113669 Syn Fluid 
pells4 Normal 


' 29.7 


16.7 



Table 87 . GeneraLscreeningjaneiLvLS 



Tissue Name > 


Reh Exp.(%) Ag52 57 5 Run 
229827564 


Tissue Name 


Rel. Exp.(%) Ag5257, Run 


Adipose 


3.5 


Renal ca. 1K-1U 




Melanoma* Hs688(A).T 


19.5 


Bladder 1 




Melanoma* Hs688(B).T 


14.5 1 


NCI-N87 


20.0 


Melanoma* Ml 4 


6.3 


Gastric ca. KATO III 


18.9 


Melanoma* LOX1MVI 


0.9 


Colon ca. SW-948 


4.3 


Melanoma* SK-MEL-5 


16.5 


Colon ca. SW480 


5.1 


Squamous cell 
carcinoma SCC-4 


1.9 


Colon ca * (SW480 met) 
SW620 


76.8 


Testis Pool 


5.2 


Colon ca. HT29 , 


4.6 


Prostate ca.* (bone met) 
PC-3 


6.1 


Colon ca. HCT-116 


5.8 


Prostate Pool 


4.7 


Colon ca. CaCo-2 


5.3 


Placenta 


0.6 


Colon cancer tissue 


6.8 


Uterus Pool 


3.0 


Colon ca. SW1116 


1.7 


Ovarian ca. OVCAR-3 


2.5 


Colon ca. Colo-205 


10.3 


Ovarian ca. SK-OV-3 


11.5 


Colon ca. SW-48 


8.4 


Ovarian ca. OVCAR-4 


• 0.4 ; 


Colon Pool 


18.2 


Ovarian ca. OVCAR-5 


8.1 


Small Intestine Pool 


6.1 


Ovarian ca. IGROV-1 


3.3 


Stomach Pool 


5.4 


Ovarian ca. OVCAR-8 


1.5 


Bone Marrow Pool \ 


2.9 


Ovary 


6.1 


Fetal.Heart 


2.1 


Breast ca. MCF-7 


8.7 


Heart Pool 


5.1 


Breast ca. MDA-MB- 
231 


10.7 


Lymph Node Pool ! 


21.0 


Breast ca. BT 549 


5.4 


Fetal Skeletal Muscle 


2.4 


Breast ca. T47D 


1.9 


Skeletal Muscle Pool 


2.1 


Breast ca. MDA-N 


3.4 


Spleen Pool 


9.5 


Breast Pool 


17.4 


Thymus Pool 


12.4 
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Trachea 


6.0 


CNS cancer (glio/astro) 

T TOT TV/TO 


2.4 


Lung 


1.2 


CNo cancer (gito/ astro; u- 
118-MG' 


— . r 

8.7 


Fetal Lung 


15.5 


CNS cancer (neurojmet) 
SK-N-AS ' 


17.9 


Lungca.NCLN4I7 


0.3 


CNS cancer (astro) SF-539 


3.1 


Lung ca. LX-1 


100.0 


CNS cancer (astro) SNB-75 


3.3 


Liang ca. NCLH146 


6.4 


CNS cancer (glio) SNB-19 


35.4 • 


Lune ca SHP-77 


3.7 ' 


CNS cancer (glio) SF-295 


6.1 


I /imp ca A549 


6.6 


Brain (Amygdala) Pool 


1.3 


T imp ca NCLH526 


1.7 


Brain (cerebellum) 


2.9 


Lune ca NCI-H23 


1.9 


Brain (fetal) 


5.0 


Ti una ca NOLH460 


44.1 


Brain (Hippocampus) Pool 


2.6 • 


T una en POP-fi? 


3.0 


Cerebral Cortex Pool 


2.3 


Lung ca. NCI-H522 


7.5 


Rrain fSnh^tantia. niera^ 

Pool 


1.1 


Liver 


0.0 


Brain (Thalamus) Pool 


2.8 


Fetal Liver 


1.4 


Brain (whole) 


3.6 


Liver ca. HepG2 


0.4 


Spinal Cord Pool 


1.9 


Kidney Pool 


21.3 


Adrenal Gland 


1.9 


Fetal Kidney 


6.6 


Pituitary gland Pool 


L3 


Renal ca. 786-0 


^ 1.1 


Salivary Gland 


2.9 


Renal ca. A498 


2.0 


Thyroid (female) 


0.4 


Renal ca. ACHN 


4.8 


Pancreatic ca. CAPAN2 


3.8 


Renal ca. UO-31 


5.5 


Pancreas Pool 


17.6 



Table 88 . Panel 1.3D 



Tissue Name 


Rel. Exp.(%) Ag3028, Run 
167968641 


Tissue Name 


Rel. Exp.(%) Ag3028, Run 
167968641 


Liver adenocarcinoma 


2.2 


Kidney (fetal) 


6.2 


Pancreas 


3.4 


Renal ca. 786-0 


0.7 


Pancreatic ca. CAP AN 2 


1.4 


Renal ca. A498 


1.1 


Adrenal gland 


0.6 


Renal ca. RXF 393 


6.5 


Thyroid 


0.6 |Renal ca. ACHN j 


2.9 


Salivary gland 


2.2 jRenalca. UO-31 


1.0 


Pituitary gland 


0.4 |RenaIca.TK-10 


12.3 


Brain (fetal) 


3.5 (Liver 


0.5 


Brain (whole) 


2.5 (Liver (fetal) 


0.5 


Brain (amygdala) 


iLiver ca. (hepatoblast) 
iy |HepG2 


0.3 


BTain (cerebellum) 


0.6 jLung 


1.3 


Brain (hippocampus) 


1.6 jLung (fetal) 


3.5 


Brain (substantia nigra) 


0 ? jLung ca. (small cell) LX- 


47.0 


Brain (thalamus) 


, jLung ca. (small cell) 
1,0 JNCI-H69 


18.6 


Cerebral Cortex 


n Q Lung ca. (sxell var.) 
°' 8 |SHP~77 


14.1 


Spinal cord 


1 . 1 jLung ca. (large cell)NCI- 


0.7 
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H460 




glio/astro U87-MG 


" - 12 


Lung ca. (non-sm. cell)' 
A549 


8.1 


glio/astrolM18-MG 


' 4.3 


Lung ca. (non-s.cell) 
NCI-H23 


2.4 


astrocytoma SW1783 


2.5 


Lung ca. (non-s.cell) 

TT/-\T> /TO 

riVr-oZ 


4.8 


neuro*; met SK-N-AS 


7.5 


Lung ca. (non-s.cl) NCI- 


4.0 


astrocytoma SF-539 


, 2.0 


Lung ca n (squam.) SW 
oah 


4.9 , 


astrocytoma SNB-75 


3.5 


Lung ca. (squam.) NCI- 
H596 


16.0 


elioma SNB-19 


1/6 


Mammary gland 


3.1 


glioma U251 


9.4 


Breast ca.* (pl.ef) MCF- 
7 


4.2 


glioma SF-295 . 


2.2 


Breast ca.* (pl.ef) MDA- : 
Mfi-231 . 


H.J 


Heart (fetal) 


0.8 


Breast ca.* (pl.ef) T47D : 


11.1 


Heart 


0.8 


Breast ca. BT-549 i 


i.o 


Skeletal muscle (fetal) 


1.4 


Breast ca. MDA-N 1 


, 3.3 


Skeletal muscle 


0.8 


Ovary 


0.9 


Bone marrow 


0.5 ■ I 


Ovarian ca. OVCAR-3 ; 


1.6 


Thymus 


7.9 


Ovarian ca. OVCAR-4 | 


0.2 


Spleen 


3.8 ; 


Ovarian ca. OVCAR-5 


9.2 


Lymph node \ 


1 0 9 


Ovarian ra OVPAR-R 


0 4 


Colorectal 


3.9 . j . 


Ovarian ca. 1GROV-1 i 


3.3 


Stomach 


1.9 


Ovarian ca.* (ascites) 
SK-OV-3 


18.4 


.^TY^all intpctinp 
OUlali JUlColliiC 


1.2 


Uterus 


5.6 




Colon ca. SW480 


13 


Placenta 


0.1 


Colon ca.* SW620(SW480 

IJuCl/ 


100.0 


Prostate 


0.6 


Colon ca. HT29 


4.5 


Prostate ca.* (bone j 
met)PC-3 


,e 


Cnlnn ca HCT-llfi 

V-V/lvMl V/W. X X \s X X IV) 


2.0 


Testis 1 


0.9 

. 


Colon ca. CaCo-2 


1.5 


Melanoma Hs688(A).T 


4.4 


Colon ca. 


6.3 


Melanoma* (met) j 
Hs688(B).T ! 


5.0 


Colon ca. HCC-2998 


7.9 


Melanoma UACC-62 


2.5 


Gastric ca.* (liver met) 
NCI-N87 


6.2 


Melanoma Ml 4 


2.3 


Bladder 


7.9 


Melanoma LOX IMVI ! 


0.4 


Trachea 


2.1 


Melanoma* (met) SK- 
MEL- 5 


6.5 


Kidney 


1.6 


Adipose j 


6.9 


Table 89. Panel 4. ID 


Tissue Name 


Rel. Exp.(%) Ag5257> 
Run 229851532 


Tissue Name 


Rel. Exp.(%) Ag5257, 
Run 229851532 


Secondary Thl act 


80.7 


HUVEC IL-lbeta 


10.7 . 
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Secondary Th2 act 


81.8 


HUVECIFN gamma 


18.8 


Secondary Trl act 


66.4 


riUVEC TNF alpha + IFN 
gamma 


5.8 


Secondary Thl rest 


L. 1 ; , 

12.2 


HUVEC TNF alpha + IL4 


' f 7.8 


Secondary Th2 rest 


17.4 


HUVECIL-U , 


7.3 


Secondary Trl rest 


15.4 


Lung Microvascular EC none 


20.0 


Primary Thl act 


3.0 


Lung Microvascular EC 
TNFalpha + IL-l ( beta 


14.4 


Primary Th2 act 


51.8 


Microvascular Dermal EC none 


5.1 

- - a :.t, ,4*-t ■■ ■- 


Primary Trl act 


i 

34.9 


Microsvasular Dermal EC 
TNFalpha + IL-1 beta 


8.5 


Primary Thl rest» 


2.3 


Bronchial epithelium TNFalpha 
-HLlbeta ; 


0.0 


Primary Th2 rest 


11.3 


Small airway epithelium none 


0.0 


Primary Trl rest 


9.7 


Small airway epithelium 
TNFalpha + IL-1 beta 


0.0 


CD45RA CD4 lymphocyte 
act 


25.3 


Coronery artery SMC rest 1 


10.7 


CD45RO CD4 lvmohocvte 
act 


66\0 


Coronery artery SMC TNFalpha 
+ IL~lbeta 


7.2 


CDR lvmtVhocvte act 


27.2 


Astrocytes rest 


* 15.6 


Secondary CD8 

lvmnVinrvtP rest 


31.0 


Astrocytes TNFalpha + IL-1 beta 


8.5 


Secondary CDS 
lymphocyte act 


12.9 ] 


KU-812 (Basophil) rest 


58.6 


CD4 lymphocyte none 


9.9 


KU-812 (Basophil) 
PMA/ionomycin . 


70.7 


2ryThl/Th2/Trl anti- 
CD95 CH11 


y.o 


LA_/Uiiuo ^iverannocyies^ none 


u.o 


LAK cells rest 


u.y 


CCD 1 1 06 (Keratinocytes) 
TNFalpha + IL-lbeta j 


2.1 


LAK cells IL-2 


29.3 


Liver cirrhosis 


7.4 


LAK cells IL-2+1L-12 


21.2 


NCI-H292 none 


4.4 


LAK cells IL-2+1FN 
gamma 


12.2 


NCI-H292 IL-4 


10.2 


LAK cells IL-2+ IL-18 


20.2 


NC1-H292 IL-9 


5.2 


LAK cells 
PMA/ionomycin 


27.2 


NCI-H292 IL-1 3 


7.3 


NK Cells IL-2 rest 


90.8 


NCI-H292 IFN gamma 


10.0 


Two Way MLR 3 day 


25.7 


HPAEC none 


9.0 


Two Way MLR 5 day 


9.4 


HPAEC TNF alpha + IL-1 beta 


19.8 


Two Way MLR 7 day 


10.2 


Lung fibroblast none 


15.1 


PBMC rest 


7.2 


Lung fibroblast TNF alpha + IL- 

1 DCld 


11.0 


PBMC PWM 


12.4 


Lung fibroblast IL-4 


10.9 


PBMC PHA-L 


19.6 


Lung fibroblast IL-9 


14.9 


Ramos (B cell) none 


29.7 


Lung fibroblast IL-1 3 


7.1 


Ramos (B cell) ionomycin 


40.9 


Lung fibroblast IFN gamma 


28.9 


B lymphocytes PWM 


29.5 


Dermal fibroblast CCD 1070 rest. 


17.8 


B lymphocytes CD40L 
and IL-4 


68.3 


Dermal fibroblast CCD 1070 
TNF alpha 


100.0 
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3 



I " - "' ' " 

EOL-l dbcAMP 

1 


0.0 


i^ermai noroDjasi ^\sxj i u / u 
1 beta 


12.8 


lEOL-1 dbcAMP 
JPMA/ionomycin 


0.0 


Dermal fibroblast IFN gamma 


8.4 


jDendritic cells none 


33.7 


Dermal fibroblast IL-4 


19.1 


jDendritic cells LPS 


27.2 


Dermal Fibroblasts rest 


7.8 


(Dendritic cells anti-CD40 


16.5 


Neutrophils TNFa+LPS 


0.0 


|Monocytes rest 


0.0 


Neutrophils rest 


3.3 


[Monocytes LPS 


5.5 


Colon 


0.0 


[Macrophages rest 


3.1 


Lung 1 


0.0 


jMacrophages LPS 


0.0 


Thymus 


.14.1 


]HUVEC none 


,7.1 


Kidney ] 9.5 


jHUVEC starved 


8.8 


V 1 


Table 90. Panel 4D 


Tissue Name 


Re). Exp.(%) Ag3028, 
Run 164528 J 68 


Tissue Name 


Rel. Exp.(%) Ag3028, 
Run 164528168 


Secondary Thl act 


28.7 


HUVEC IL-lbeta 


3.0 


Secondary Tli2 act 


26.4 


HUVEC IFN gamma , 


8.8 


Secondary Trl act 


30.6 


gamma 


8.0 


Secondary inl rest 


IZ.O 


WT TVPP TNT« tlnha + TT 4 


6.6 


Secondary Th2 rest 


23.7 


HUVEC IL-11 


4.4 


Secondary Trl rest 


2L9 


Lung Microvascular EC none 


4.3 


Primary Thl act 


12.3 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


7.8 


Primary Th2 act 


26.4 


Microvascular Dermal EC none 


/.o 


Primary Trl act 


28.9 


iVIldUo VaaUlal JL/plIIJal J jV^ ; 

TNFalpha + IL-lbeta 


9.7 


Primary Thl rest I 


82.4 


Bronchial epithelium TNFalpha ! 
+ ILlbeta 


» 


Primary Th2 rest 


55.1 


Small airway epithelium none 


0.8 


Primary Trl rest 


49.7 


Small airway epithelium : 
TNFalpha + IL-lbeta 


2.6 


CD45RA CD4 lymphocyte 
act 


12.3 1 


Coronery artery SMC rest 


5.8 


CD45RO CD4 lymphocyte 
act 


3L0 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


2.8 


CD8 lymphocyte act 


20.6 


Astrocytes rest 


10.9 


Secondary CD8 
lymphocyte rest 


24.7 


Astrocytes TNFalpha + IL-lbeta 


6.2 


Secondary CDS 
lymphocyte act 


29.5 


KU-8 12 (Basophil) rest 


23.2 


CD4 lymphocyte none 


13.0 


KU-812 (Basophil) 
PMA/ionomycin 


55.9 


2ryThl/Th2/Trl anti- 
CD95 CH11 


32.1 


CCD1 106 (Keratinocytes) none 


2.6 


LAK cells rest 


25.5 


CCD1106 (Keratinocytes) 
TNFalpha + IL-lbeta 


1.4 


LAK cells IL-2 


33.9 


Liver cirrhosis 


1.7 


LAK cells IL-2+IL-12 


27.2 


Lupus kidney 


1-8 
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LAK cells IL-2-HFN 
gamma 


52.5 


NCI-H292 none 


3.6 


LAK cells IL-2+ IL-18 


43.8 


NCLH292 IL-4 


5.8 


LAK cells, 
PMA/ionomycin 


6.7 


NCLH292 IL-0 


4.8 


NK Cells IL-2 rest 


24.5 


NCI-H292 IL-13 


2.9 


Two Way MLR 3 day 


17.0 


NCI-H292 IFN gamma 


4.2 


Two Way MLR 5 day 


9.2 


HPAEC none 


4.9 


Two Way MLR 7 day 


24.3 


HPAEC TNF alpha + IL-i beta 


6.6 


PBMC rest 


8.9 


Lung fibroblast none 


5.3 


PBMC PWM 


62.4 

■ ••• i •-- ■ - ■ ■ 


Lung fibroblast TNF alpha + IL- 
1 beta 


'3.7 


jPBMC PHA-L 


33.9 


Lung fibroblast IL-4 


14.6 


jAamos (B cell) none 


24.3 


Lung fibroblast IL-9 


10.4 


(Ramos (B cell) ionomycin 


100.0 


Lung fibroblast IL-13 


11.4 


jB lymphocytes PWM 


97.9 


Lung fibroblast IFN gamma 


18.4 


B lymphocytes CD40L 
jand IL-4 


f 4. / 


uermaJ fibroblast ocuiu / U rest 


ion 


|eOL-1 dbcAMP 


0.0 


Dermal fibroblast CCD 1 070 * 
TNF alpha 


63.7 


EOL-l dbcAMP 
|PMA/i onomycin 


0.0 


Dermal fibroblast CCD 1070 IL- ' 
1 beta , 


6.0 


jDcndritic cells none 


1 o 1 i 

1Z. / 1 


Dermal fibroblast IFN gamma 


4.7 


jJL't/nUJlllL t/CJli> i^xij 


19.8 J 


Dermal fibroblast IL-4 


1 0 *7 


(Dendritic cells anti-CD40 


16.0 j 


IBD Colitis 2 


4.6 


[Monocytes rest 


1.6 j 


IBD Crohn's 


2.0 


(Monocytes LPS 


L4 


Colon : 


14.2 


(Macrophages rest 


6.7 j 


Lung 


6.6 


[Macrophages LPS 


3,0 | 


Thymus 


7.0 


HUVEC none 


4.3 j 


fCidney 


41.8 


[HUVEC starved 


10.2 j 





AI comprehensive paneljvLO Summary: Ag3Q28 This experiment was performed 
using a probe/primer set that recognizes both the CG56 108-01 and CG56 108-02 variants. This 
gene is expressed at low to moderate levels in most of the tissues on this panel. However, gene 



expression is up-regulated in the skin of 3/4 psoriasis patients and 4/5 colon form patients 
5 suffering from ulcerative colitis. In addition, bone and cartilage from osteoarthritis (OA) 
patients seems to have increased expression of this gene. Therefore, modulation of the 
expression or activity of this gene or its protein product by small peptides could be beneficial 
for the treatment of the condition or symptoms associated with psoriasis, OA and ulcerative 
colitis. 

10 Ag5257 The expression of this gene was assessed in an independent experiment with a 

different probe/primer set that specifically recognizes the CG561 08-02 variant. The results 
from this experiment were similar to what was observed with Ag3028. 
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General__screeningj>anel vl.5 Summary: AgS257 Expression of the CG561 08-02 
variant is highest in a sample derived from lung cancer cell line LX-1 (CT — 30). In addition, 
there is substantial expression of this gene seen in a sample derived from a, metastatic colon 
cancer cell line (SW620), as was seen using probe/primer set Ag3028. This is in contrast to the 
5 low level of expression in a genetically related cell line (SW480) that was derived from the 
primary tumor in the same patient. Thus, expression of this gene could be used to distinguish 
the sample derived from LX-1 cells from other samples in the panel, and also distinguish the 
sample derived from SW620 cells from SW480 cells. Moreover, therapeutic modulation of the 
activity of this gene or its protein product, through the use of small molecule drugs, antibodies 
1 0 or protein therapeutics, might be of benefit in the treatment of metastatic colon cancer or lung 
cancer. 

The CG561 08-02 gene encodes a variant of the TRAF5 gene. Tumor necrosis factor 

i 

(TNF) receptor-associated factors (TRAFs) are signal transducers for members of the TNP 

i • 

receptor superfamily. TRAF proteins are composed of an N-terminal cysteine/Jiistidine-rich 

15 region containing zinc RING and/or zinc finger motifs, a coiled coil (leucine zipper) motif, 
and a homologous region in the C terminus that defines the TRAF family, the TRAF domain. 
The TRAF domain is involved in self-association and receptor binding. Among tissues with 
metabolic or endocrine function, expression of this TRAF5 variant is limited to pancreas, 
indicating a potential role in insulin production and secretion. Therefore, therapeutic 

20 modulation of the activity of this gene may aid in the treatment of type II diabetes. 

Panel 1.3D Summary: Ag3028 Expression of this gene is highest in a sample derived 
from metastatic colon cancer cell line SW620 (CT = 26). Thisis in contrast to the low level of 
expression in genetically related cell line SW480 that was derived from the primary tumor in 
the same patient (CT = 33). There is also substantial expression of this gene in a number of 

25 lung cancer cell lines. Thus, the expression of this gene could be used to distinguish the 
sample derived from SW620 cells from other samples in the panel, and in particular, the 
sample derived from SW480 cells. Moreover, therepeutic modulation of the activity of this 
gene or its protein product, through the use of small molecule drugs, antibodies or protein 
therapeutics, might be of benefit in the treatment of metastatic colon cancer. 

30 This gene is also expressed at low levels throughout the central nervous system, 

including in amygdala, hippocampus, cerebellum, substantia nigra, thalamus, cerebral cortex, 
and spinal cord (CTs = 32.8-34.5). Thus, this gene may play a general role in central nervous 
system function. 
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In addition, this gene is expressed at low levels in a number of tissues with metabolic 
or endocrine function including adrenal gland, thyroid, heart, skeletal muscle and liver and at 
higher levels in pancreas and adipose. Therefore, modulation of the activity'of this gene or its 
protein product using small molecule drugs, antibodies or protein therapeutics might be of 
benefit in the treatment of metabolic diseases such as diabetes and obesity. 

Panel 4.1D Summary: Ag5257 Expression of the CG56108-02 gene was assessed in 

an independent experiment using a probe/primer set specific for this variant. The expression 

profile of this gene across panel 4. ID is generally similar to what was observed in Panel 4D 

except that the level of expression is lower. However, some slight differences in expression 

are seen: highest expression of this gene is observed in dermal fibroblasts treated with TNF-a 
i 

and IL-1, in NIC cells and the basophil cell line KU-812. Therefore, modulation of the activity 
of this gene or its protein product by small molecule drugs could blbck inflammatory 
processes associated with basophil activity and skin injury, such as those observed in allergic 
diseases, asthma, inflammatory bowel disease, and psoriasis. ' ' 

Panel 4D Summary: Ag3028 Thjs gene is expressed at low to moderate levels in the 
majority of samples on Panel 4D. However, expression of this gene is highest in activated B 
cells (Ramos cell line treated with ionomycin), activated B lymphocytes (PWM treated) and B 
cells treated with CD40L and IL-4 (CTs = 28), This gene is. also expressed at significant levels 
in activated T cells, LAK cells and kidney. 

This gene encodes for a protein that appears to be a variant of TRAF5, a signal 
transducer for the TNFR family which eventually leads to the activation of NF-kB. It has been 
reported that Traf5 -/- B lymphocytes show defects in proliferation and upregulation of various 
surface molecules, including CD23, CD54, CD80, CD86 and FAS in response to CD40 
stimulation (ref. 1). Moreover, in vitro Ig production by Traf5 -/- T lymphocytes stimulated 
with anti~CD40 plus IL4 was reduced substantially. 

Thus, modulation of the expression or activity of this gene or its protein product using 
small peptides could be beneficial for the treatment of B cell lymphoproliferative diseases or 
diseases associated with hyperglobulinemia, such as those observed in autoimmune diseases 
including systemic lupus erythematosus and rheumatoid arthritis. Furthermore, B cells 
signalling through CD40 in the presence of IL-4 can lead to immunoglobulin class switch to 
IgE and IgE can lead to severe allergic disorders. Therefore, modulation of the activity of this 
gene or its protein product using small molecule dmgs, antibodies, or protein therapeutics, 
could be beneficial for the treatment of allergic diseases. Finally, the presence of this transcript 
in activated T cells suggest that therapeutics designed against this molecule could be beneficial 
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for the treatment of T cell mediated diseases, including inflammatory bowel disease (IBD), 
psoriasis, and rheumatoid arthritis (Nakano H, Oshima H, Chung W, Williams- Abbott L, Ware 
CF, Yagita H, Okumura K. TRAF5, an activator of NF-kappaB' and putative signal transducer 
for the lymphotoxin-beta receptor. J Biol Chem 1996 Jun21;271(25):14661-4). 



M. NOV17: Ferritin light chain 

Expression of the NOV 3 7 gene (CG56101-G1) was assessed using the primer-probe set 
Ag2912, described in Table 91. Results of the RTQ-PCR run is shown in Table 92, 
Table 91 . Probe Name Ag2912 ' 



Primers 


Sequences 


_ | Start 
^ ength | Position 


Forward 


5 » -aattactgcaccgaagtggaa~3 ' (SEQ ID NO; 253) 


21 j 39 


Probe 


TET-5 ' -ctgcgggcttcccttacctacctct-3 ' -TAMRA (SEQ ID 
NO:254) 


25 j 90 


Reversed 


5 1 -cggtagaaatggaggatgaga-3 1 (SEQ ID NO:255) 


21 j 116 


Table 92. Panel 1.3D 



10 



Tissue Name 


Rel. Exp.(%) Ag2912, Run ! 
161409374 


Tissue Name 


Rel. Exp.(%) Ag2912, Run 
161409374 


Liver adenocarcinoma 


0.0 


Kidney (fetal) ! 


0.0 


Pancreas 


0.0 


Renal ca. 786-0 


0.0 


Pancreatic ca. CAP AN 2 


0.0 


Renal ca. A498 


0.0 


Adrenal gland 


0.0 


Renal ca. RXF 393 


0.0 


Thyroid 1 


0.0 


Renal ca. ACHN 


0.0 


Salivary gland 


0.0 


Renal ca. UO-31 


0.0 


Pituitary gland 


0.0 


Renal ca. TK-10 


0.0 


Brain (fetal) 


0.0 


Liver 


0.0 


Brain (whole) 


0.0 


Liver (fetal) 


0.0 


Brain (amygdala) 


0.0 


Liver ca. (hepatoblast) 
HepG2 


0.0 


Brain (cerebellum) 


0.0 


Lung 


0.0 


Brain (hippocampus) 


0.0 


Lung (fetal) 


0.0 


Brain (substantia nigra) 


0.0 


Lung ca. (small cell) LX- ! 
1 


0.0 


Brain (thalamus) 


0.0 


Lung ca. (small cell) 
NCI-H69 


0.0 


Cerebral Cortex 


0.0 ' 


Lung ca. (s.cell var.) 
SHP-77 


0.0 


Spinal cord 


0.0 


Lung ca. (large cell)NCI- 
H460 


0.0 


glio/astro U87-MG 


0.0 ! 


Lung ca. (non-snx cell) 
A549 


0.0 


glio/astro U-118-MG 


0.0 


Lung ca. (non-s.cell) 
NCLH23 


0.0 


astrocytoma SW 17 83 


0.0 


Lung ca. (non-s.cell) 
HOP-62 


0.0 


nemo*; met SK-"N-AS 


0.0 


Lung ca. (non-s.cl) NCL 


0.0 
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H522 




astrocytoma bro^y 


m " • : 
ft ft 

u.u « 


Lung ca. (squam.) SW 
900 


0.0 


astrocytoma SNB-75 


. — (— 1 — — - — - 

0.0 


Lung ca. (squam.) NCI- j 
tijyo 


0.0 


glioma SNB-19 


0.0 


Mammary gland 


O A 

u.u 


glioma U251 


0.0 


Breast ca.* (pl.ei) Mtr- 
7 


0.0 


glioma SF-295 j 


0.0 , 

1 .... . 


ftreast ca * (t>1 ef> MDA- 
MB-231 


0.0 ., 


Heart (fetal) 


• 0.0 


Breast ca* (pl.ef) T47D 


0.0 


Heart 


0.0 


Breast ca. BT-549 


0.0 


Skeletal muscle (fetal) 


Q.O 


Breast ca. MDA-N 


0.0 


Skeletal muscle 


„• 1 ■-■ =a- : - — ' 

ft 0 

u.v 


Ovary 


0.0 


Bone marrow ' 


U.u i 


Ovarian ra 

\J\ <\\ I till t^ci. v/ V \^r\l\ 


0.0 


Thymus 


u.u 


HvaniTi rn OVPAP-4 


0.0 


Spleen 


0.0 


Ovarian ca. OVCAR-5" 


0.0 


Lymph node 


0.0 


Ovarian ca. OVCAR-8 


25.3 


Colorectal 


.7:5 


Ovarian ca. IGROV-1 


0.0 


Stomach 1 


0-0 1 


Ovarian ca.*' (ascites) j 
SK-OV-3 


1 0.0 


Small intestine 


0.0 


Uterus 


0,0 


Colon ca. SW480 | 


0.0 


Placenta 


0.0 


Colon ca.* SW620(SW480j 
met) 


0.0 | 


Prostate 


0.0 


Colon ca. HT29 


0.0 


Prostate ca.* (bone 
met)PC-3 


0.0 


Colon ca. HCT-U6 


0.0 


Testis 1 


100,0 


Colon ca. CaCo-2 


0.0 


Melanoma Hs688(A).T j 


0.0 


Colon ca. 
tissue(OD03866) 


0.0 


Melanoma* (met) 


0.0 


Colon ca. HCC-2998 


0.0 


Melanoma UAdC-62 


0.0 


Gastric ca.* (liver met) 
NCI-N87 


0.0 


Melanoma M14 


0.0 


Bladder 


0.0 


Melanoma LOX IMVI 


0.0 


Trachea 


0.0 


Melanoma* (met) SK- 
MEL-5 


0.0 


Kidney 


0.0 


Adipose 


0.0 



Panel 1.3D Summary; Ag2912 Low but significant expression of the CG56101-01 



gene is limited to testis (CT = 34.8). Therefore, expression of this gene could be used to 
distinguish testis from the other samples on this panel. Furthermore, therapeutic modulation of 
this gene or its protein product may be of benefit in the treatment of infertility. 
5 Panel 2D Summary: Ag2912 Run 161410554 Expression of this gene is 

low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). Run 
162354475 Results from one experiment with the CG5610T-01 gene are not included. The 
amp plot indicates that there were experimental difficulties with this run. 
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Panel 4D Summary: Ag2912 Run 159354038 Results from one experiment with the 
CG56101-01 gene are not included. The amp plot indicates that there were experimental 
difficulties with this run. ' 

5 N. NOV18: Neurotrophfri-like gene 

Expression of the NOV1 8 gene (CG56095-01) was assessed using the primer-probe set 
ag3 67 1 , described in Table 93 . 
Table 93 . Probe Name ag3671 



primers 

: 1 ; 


Sequences 


*.vJ Start 
Length ... 

j Position 1 


Forward 


5 ' -cctctccagtgtgtcaatgg-3 » (SEQ ID NO: 256) 


2 0 | 253 


Probe 


TET-5 1 -gtcctaaccccaccctcgacattgt-3 1 -TAMRA (SEQ ID; 
NO:257) 


25 | 274 


Reverse 


5 ' -aaggtcccactttggatcag-3 1 (SEQ ID NO: 258) 


20 | 308 



General_screeningj>anel_vl.4 Summary: Ag3671 Expression of the CG56095-01 



10 gene is low/undetectable (CTs > 35) across all of the samples on this panel (data not shown). 

P. NOV19a-NOV19c: Methionyl Aminopeptidase 

Expression of the NOV19a-NOV19c genes (CG50287-02, CG50287-01, and 
CG50287-03) was assessed using the primer-probe sets Ag2541 and Ag3675, described in 
15 Tables 94-95. Results of the RTQ-PCR runs are shown in Tables 96-101. 
Table 94 . Probe Name Ag2541 



Primers 


Sequences 






jLength! 


Start 
Position 


Forward 


5 ' -cgggaaatcatcagtcataatg-3 1 (SEQ 


ID 


NO:259) 


,1 22 


383 


Probe 


TET-5 1 -tccctcacctctaggctatggaggtt- 
ID NO:260) 


-3 ' - 


-TAMRA (SEQ 


j 26 ' 


409 


Reverse 


5 1 -tgacagagcacgttgtttacag-3 ' (SEQ 


ID 


NO:261) 


i 22 


456 



Table 95 . Probe Name Ag3675 



Primers 


Sequences 


Length 


Start 
Position 


Forward; 


5 ' -cgggaaatcatcagtcataatg-3 1 (SEQ ID NO:262) 


22 


383 


Probe 


TET-5 1 -tccctcacctctaggctatggaggtt -3 1 -TAMRA (SEQ 
ID NO: 263) 


26 


409 


Reverse 


5 1 -tgacagagcacgttgtttacag-3 ♦ (SEQ ID NO: 264) 


22 


456 



Table 96 . CNSjneurodegeneration_vl.O 



Tissue Name 


Rel. Exp.(%) Ag2541, Run 
206271442 


Tissue Name 


Rel. Exp.(%) Ag2541, Run 
206271442 


AD 1 Hippo 


8.2 


Control (Path) 3 
Temporal Ctx 


2.4 


AD 2 Hippo 


36.6 


Control (Path) 4 
Temporal Ctx 


50.0 
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AD 3 Hippo 


6.0 


AD 1 Occipital Ctx 


23.2 


AD 4 Hippo 


Q ^ 


AD 2 Occipital Ctx 
(Missing) 


0.0 


AD 5 Hippo 


100.0 


AD 3 Occipital Ctx ' j 


9.3 


AD 6 Hippo 


39.2 


AD 4 Occipital Ctx 


33.0 


Control 2 Hippo 


30.8 


AD 5 Occipital Ctx 


44.4 


Control 4 Hippo 


6.8 


AD 6 Occipital Ctx 


18.0 


Control (Path) 3 Hippo 


3.7 


Control 1 Occipital Ctx 


4.4 


AD 1 Temporal Ctx 


15.4 


Control 2 Occipital Ctx 


62.4 


AD 2 Temporal Ctx 


A C f\ 

46.U 


control d uccipiiai v>ix 


ZJ.J 


AD 3 Temporal Ctx 


3.5 


Control 4 Occipital Ctx 


5.8 


AD 4 Temporal Ctx 


33.9 


Control (Path) 1 
Occipital Ctx 


88.9 


AD '5 Inf Temporal Ctx 


100.0 


Control (Path) 2 
Occipital ctx 


12.9 


AD 5 Sup Temporal 
Ctx 


41.5 ; 


Control (l am; o 
Occipital Ctx 


3.6 


AD 6 Inf Temporal Ctx 


24.8 1 


Control (ratJi) 4 
Occipital Ctx 


21.5 


AD 6 Sup Temporal 
Ctx 


42.6 1 


Control 1 Parietal Ctx ] 


12.5 . 


Control 1 Temporal Ctx 


9.0 


Control 2 Fanetal ctx 


AC\ A 
4U.0 


Control 2 Temporal Ctx 


27.4 i 


Control 3 Parietal Ctx 


16.8 


Control 3 Temporal Ctx 


22.4 | 


Control (Path) l i 
Parietal Ctx 


87.1 


Control 3 Temporal Ctx 


6.3 


Control (Path) 2 \ 
Parietal Ctx 


34.2 


Control (Path) 1 
Temporal Ctx 


47.3 


Control (Path) 3 
Parietal Ctx 


3.1 


Control (Path) 2 
Temporal Ctx 


37.4 


Control (Path) 4 
Parietal Ctx 


41.8 



Table 97 . General_screening_panel__vl,4 



Tissue Name 


Re). Exp.(%) Ag3675, Run 
218952739 


Tissue Name 


Rel. Exp.(%) Ag3675, Run 
218952739 


Adipose 


6.8 


Renal ca. TK- 10 


63.7 


Melanoma* Hs688(A).T 


12.8 


Bladder 


14.6 


Melanoma* Hs688(B).T 


17.1 ! 


Gastric ca. (liver met.) 
NCI-N87 


100.0 


Melanoma* M14 


26.4 


Gastric ca. KATO IH 


64.2 


Melanoma* LOXIMV1 


27.5 


Colon ca. SW-948 \ 


9.9 


Melanoma* SK-MEL-5 


29.1 


Colon ca. SW480 


33.7 


Squamous cell 
carcinoma SCC-4 


8.1 


Colon ca.* (SW480met) 
SW620 ; 


33.9 


Testis Pool 


9.0 


Colon ca. HT29 


20.2 


Prostate ca.* (bone met) 
PC-3 


51.8 


Colon ca. HCT-116 , 


70.7 


Prostate Pool 


7.5 


Colon ca. CaCo-2 


39.8 


Placenta 


1.9 


Colon cancer tissue 


16.3 


Uterus Pool 


4.7 


Colon ca. SW1116 


7.9 


Ovarian ca. OVCAR-3 


23.0 


Colon ca. Colo-205 , 


7.8 
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Ovarian ca. SK-OV-3 




36.9 


OX1T AG 

(Colon ca. SW-48 


9.9 


\ Ovarian ca. OVCAR-4 




-4.5. 


jColon Pool 


24.0 


Ovarian ca. OVCAR-5 




80.1 


(Small Intestine Pool 


23.3 


Ovarian ca. IGROV-1 




112.6 


|Stomach Pool 


• ' 18.3 


Ovarian ca. OVCAR-8 




9.2 


(Bone Marrow Pool 


7.9 


Ovary 




20.2 


(Fetal Heart 


23.5 


Breast ca,MCF-7 




35.4 


(Heart Pool 


6.8 


Breast ca. MDA-MB- 
231 




49.3 


IT vninli TsJndf* Pnn1 


29.3 


Breast ca. BT 549 




13.4 


' (Fetal Skeletal Muscle 


17.8 ' , 


Breast ca. T47D 




100.0 


(Skeletal Muscle Pool 


19.9 


Breast ca. MDA-N 




11.6 


|Spleen Pool 


8.2 


Breast Pool 




20.3. 


[Thymus Pool 


14.0 


Trachea 

• 




13.8 


|CNS cancer (glio/astro) 
JU87-MG 


34.9 


Lung 




i i . i 


|CNS cancer (glio/astro) U- 
|ll8-MG , 


71 ? 


Fetal Lung 




36.9 


CNS cancer (neuro;met) 
SK-N-AS 


13.1 


Lung ca. NCI-N417 




'5.8 


JCNS cancer (astro) SF T 539 


16.6 


Lungca. LX-1 , 




64.2 


jCNS cancer (astro) SNB-75. 


33,7 


Lung ca. NCLH146 




7.7 


fCNS cancer (glio) SNB- 1 9 


9.0 


Lung ca. SHP-77 




25.2 


(CNS cancer (glio) SF-295 


89.5 


Lung ca. A549 




25.5 


jBrain (Amygdala) Pool 


7.7 


Lung ca. NCI-H526 




4.2 


jBrain (cerebellum) 


6.6 


Lung ca. NCI-H23 




36.1 


jBrain (fetal) 


25.9 


Lung ca. NCI-H460 




31.2 


jBrain (Hippocampus) Pool j 


9.2 


Lung ca. HOP-62 




18.2 


(Cerebral Cortex Pool 


10.2 


Lung ca. "NCI-H522 




36.3 


Brain (Substantia nigra) 
(Pool 


7.6 


Liver 




1.7 


jBrain (Thalamus) Pool 


12.7 


Fetal Liver 




21.5 


|Brain (whole) 


9.5 


Liver ca. HepG2 




36.6 


jSpinal Cord Pool 


7.2 


Kidney Pool 




29.7 


(Adrenal Gland 


11.0 


Fetal Kidney 




48.0 


jPituitary gland Pool 


3.4 


Renal ca. 786-0 




24.3 


(Salivary Gland 


6.5 


Renai ca. A498 




5.3 


jThyroid (female) 


2.8 


Renal ca. ACHN 




25.7 


jpancreatic ca. CAPAN2 


17.8 


Renal ca. UO-31 




17.8 


(pancreas Pool 


26.1 


Table 98. Panel 1.3D 


Tissue Name 




ReL Exp.(%) Ag2541, Run 
155690258 


Liver adenocarcinoma 


9.9 


(Kidney (fetal) 


4.3 


Pancreas 


5.8 


jRenal ca. 786-0 


10.0 


Pancreatic ca. CAP AN 2 


3.4 


jRenal ca. A498 


31.6 


Adrenal gland 


6.3 


jRenal ca. RXF393 


2.1 


Thyroid 


7.1 


(Renal ca. ACHN 


6.0 


Salivary gland 


4.5 


(Renal ca. UO-31 


14.3 


Pituitary gland 


8.5 


jRenal ca. TK-10 


17.0 
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D I til Jl ^lCLo-i^ 


9.2 I 


Liver 


3.4 • 


Brain (whole) 


'8.2 ' ! 


Liver (fetal) 


• ' 6.8 


Brain (amygdala) 


. 9.1 I 


jbiver ca* ^juepaxouia&i^ 
HepG2 1 


33.7 


Brain (cerebellum) 


3.8 


Lung 




Brain (hippocampus) 


24.1 ; 


Lung (fetal) 


8.0 


Brain (substantia nigra) 


1.5 


Lung ca. (small cell) LX- 
1 


13.9 


Brain (thalamus) 


5.4 


Lung ca. (small cell) 
NCI-H69 


4.3 


Cerebral Cortex 


9.0 j 


Lung ca. (s.cell var.) 
SHP-77 


17.6 


Spinal cord 


3.8 


Lung ca. (large cell)NCI- ; 
H460 


8.8 


glio/astro U87-MG 


19.3 j 


Lung ca. (non-sm. cell) 
A549 


14.0 


glio/astroU-118-MG 


100.0 


Lung ca. (non-s.cell) 
NCI-H23 1 


19.1 


astrocytoma SW1783 


8.1 ! 
1 * 


Lung ca. (non-s.cell) 
HOP-62 


7.6 


neuro*; met SK-N-AS 


27.5 j 


Lung ca. (non-s.cl) NCI- 
H522 


' 10.7 


astrocytoma SF-539 


14.6 j 


Lung ca. (squam.) SW 
900 


5.4 

^ — ^_ 


astrocytoma SNB-75 


15.9 


Lung ca. (squam.) NUU 
H596 


1.2 


glioma oiNr>-iy 


12.8 


Mammary gland 


18.3 


glioma U251 


9.7 


Breast ca * (nl ef) MCF- 
7 


16.5 


glioma SF-295 j 


32.3 '• 


Breast ca.* (pl.ef) MDA- 
MB-231 




Heart (fetal) 


2.6 


Breast ca.* (pl.ef) T47D J 


12.9 


Heart 


3.7 


Breast ca. BT-549 


11.5 


Skeletal muscle (fetal) 


19.8 


Breast ca. MDA-N 


12.9 


Skeletal muscle 


4.3 


Ovary 


15.2 


Bone marrow 


3.7 


Ovarian ca. OVCAR-3 


14.6 


Thymus 


3.0 


Ovarian ca. OVCAR-4 


0.1 


Spleen 


8.9 


Ovarian ca. OVCAR-5 


40.1 


j_rympii nouc 


5.4 


Ovarian ca. OVCAR-8 


19.5 


Colorectal 


4.3 


Ovarian ca. IGROV-1 


4.1 


Stomach 


14,8 


Ovarian ca.* (ascites) 
SK-OV-3 


16.3 




15.1 


Uterus 


6.3 


Colon ca. SW480 


15.6 


Placenta 


4.7 


colon ca. owozu^ow^ou 
met) 


20.9 


Prostate 


2.9 


Colon ca. HT29 


12.4 


Prostate ca.* (bone 
met)PC~3 


13.3 


Colon ca.HCT~116 


12.3 


Testis 


10.2 


Colon ca. CaCo-2 


17.4 


Melanoma Hs688(A).T 


6.5 


Colon ca. 
tissue(OD03866) 


13.8 


Melanoma* (met) 
Hs688(B).T 


2.5 
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Colon ca. HCC-2998 


37.9 


Melanoma UACC-62 


2.0 


Gastric ca.* (liver met) 
NCI-N87 


68.3 


Melanoma M14 


4.2 


Bladder 


4.3 ' 


Melanoma LOX IMVI 


7.2 


Trachea 


15.8 j 


Melanoma* (met) SK- 
MEL-5 


9.2 


Kidney 


3.2 


Adipose 


3.9 



Table 99 . Panel 2D 



Tissue Name 


Rel. Exp.(%) Ag2541, 
Run 155690621 


t Tissue N*ime 


Rel. Exp.(%) Ag2541, 
Run 155690621 


Normal Colon 


100.0 


Kidney Margin 8120608 


•13.3 


CC Well to Mod Diff 
(OD03866) 


1 15.1 


Kidney Cancer 8120(513 


2.8 

— , 1 — 


CC Margin (OD03866) 


■ 10.3 


Kidney Margin 8120614 


9.9 


CC Gr.2 rectosigmoid 
(OD03868) 


15.1 


Kidney Cancer 9010320 


14.6 


CC Margin (OD03868) 


5.6 


Kidney Margin 9010321 


13.6 


CC Mod Diff (OD03 920) 


63.7 


Normal Uterus 


13.0 


CC Margin (ODO3920) 


15.7 


Uterus Cancer 06401 1 


59.0 


CC Gr.2 ascend colon 
(OD03921) 


46.3 


Normal Thyroid 


22.5 


CC Margin (OD03921) 


15.5 


Thyroid Cancfer 064010 


17.3 


CC from Partial Hepatectomy 
(ODO4309) Mets 


56.6 , 


Thyroid Cancer A302152 


26.8 


Liver Margin (ODO4309) 


36.3 


Thyroid Margin A3021 53 


23.2 


Colon mets to lung (OD04451- 
01) 


8.5 j 


Normal Breast 


37.4 


Lung Margin (OD0445 1-02) 


0.9 


Breast Cancer (OD04566) 


25.3 


Normal Prostate 6S4fv-1 


24.8 


Breast Cancer (OD04590- 
01) 


60.3 


Prostate Cancer (OD04410) 


72.2 


Breast Cancer Mets 


57.8 


Prostate Margin (OD04410) 


57.4 


Breast Cancer Metastasis 
(OD04655-05) 


87.7 


Prostate Cancer (OD04720-01) 


59.9 


Breast Cancer 064006 


12.7 


Prostate Margin (OD04720-02) 


100.0 


Breast Cancer 1024 


44.8 


Normal Lung 061010 


47.6 


Breast Cancer 9 1 00266 


22.4 


Lung Met to Muscle 
(OD04286) 


30.4 


Breast Margin 9100265 


27.2 


Muscle Margin (OD04286) 


44.4 


Breast Cancer A209073 


33.7 


Lung Malignant Cancer 
(OD03126) 


22.7 


Breast Margin A2090734 


25.0 


Lung Margin (OD03 126) 


26.1 


Normal Liver 


28.3 


Lung Cancer (OD04404) 


25.0 


Liver Cancer 064003 


16.2 


Lung Margin (OD04404) 


24.7 


Liver Cancer 1 025 


11.8 


Lung Cancer (OD04565) 


9.3 


Liver Cancer 1026 


4.0 


Lung Margin (OD04565) 


5.5 


Liver Cancer 6004-T 


14.8 


Lung Cancer (OD04237-01) 


7L2 


Liver Tissue 6004-N 


20.3 


Lung Margin (OD04237-02) 


15.8 


Liver Cancer 6005-T 


7.1 


Ocular Mel Met to Liver 


52.5 


Liver Tissue 6005-N 


3.0 
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(ODO4310) 








Liver Margin (ODO4310) 


21.2 


Normal Bladder 


42.9 


Melanoma Mets to Lung 
(OD04321) 


25.0' 


Bladder Cancer 1023 


12.5 


Lung Margin (OD04321) 


16.0 


Bladder Cancer A302 173 


15.9 


iNormai xviuncy 




Bladdei 1 Cancer 
(OD047 18-01) 


27.9 


Kidney Ca, Nuclear grade 2 
(OD04338) 


If* ^ 


Bladder Normal Adjacent 
(OD04718-03) 


27.2 


Kidney Margin (OD04338) 


48.0 


Normal Ovary 1 


13.2 


Kidney Ca Nuclear grade 1/2 
(OD04339) 


0\J,J 


Hvnn'ori Canre-r O^dOOR 
wValJan Uutv^o 


64.2 


Kidney Margin (OD04339) 


60.7 


Ovarian Cancer 
(OD0476S-07) 


66.9 

i 


Kidney Ca, Clear cell type 

(UU0434U) 


49.0 


Ovary Margin (OD04768- 

Uo y 


10.6 


Kidney Margin ^OUU434U) 




Normal Stomach 




JVlUiJcy v_^ci, n ubioai gidcic «J 

(OD04348) 


9.9 


Gastric Cancer 9060358 


4.0 


Kidney Margin (OD04348) 


19.1 


Stomach Margin 9060359 


14.1 


Kidney Cancer (OD04622-01) 


9.5 


Gastric Cancer 9060395 


25.2 


Kidney Margin (OD04622-03) 


3.7 


Stomach Margin 9060394 ; 


23.8 


Kidney Cancer (OD04450-01) 


42.0 


Gastric Cancer 9060397 


82.9 


Kidney Margin (OD04450-03) 


25.5 


Stomach Margin 9060396 


4.5 


Kidney Cancer 8120607 


18.4 


Gastric Cancer 064005 


41.5 



Table 100 . Panel 4. ID 



Tissue Name 


Rel. Exp.(%) Ag3675, 
Run 169976076 


Tissue Name 


Rel. Exp.(%) Ag3675, 
Run 169976076 


Secondary Thl act 


29.3 


HUVEC IL-lbeta 


44.1 


Secondary Th2 act 


55.1 


HUVEC 1FN gamma 


41.2 


Secondary Trl act 


50.3 


HUVEC TNF alpha + IFN 
gamma 


26.2 


Secondary Thl rest 


9.0 


HUVEC TNF alpha + IL4 


42.3 


Secondary Th2 rest 


18.8 


HUVEC IL-11 


15.8 


Secondary Trl rest 


17.2 


Lung Microvascular EC none 


58.6 


Primary Thl act 


52.5 


Lung Microvascular EC 
TNFalpha + IL-lbeta 


51.1 


Primary Th2 act 


51.8 


Microvascular Dermal EC none 


42.3 


Primary Trl act 


67.4 


Microsvasular Dermal EC 
TNFalpha+ IL-lbeta ! 


27.9 


Primary Thl rest . 


20.7 


Bronchial epithelium TNFalpha 
+ ILlbeta 


19.9 


Primary Th2 rest 


18.2 


Small airway epithelium none 


9.5 


Primary Trl rest 


48.0 


Small airway epithelium 
TNFalpha + IL-lbeta 


11.3 


CD45RA CD4 lymphocyte 
act 


61.1 


Coronery artery SMC rest 


17.6 


CD45RO CD4 lymphocyte 
act 


83.5 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


12.2 


CD 8 lymphocyte act 


65.1 


Astrocytes rest 


18.2 
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Secondary CD8' 
lymphocyte Test 


61.1 ' 


Astrocytes TNFalpha + IL r lbeta 


13.5 


Secondary CD8 
lymphocyte act 


?5.3 i 


KU-812 (Basophil) rest 


r .75.3 


CD4 lymphocyte none 


33.4 


&U-812 (Basophil) 
PMA/ionomycin' 


82.9 


2ry Thl/Th2/Trl_anti- 
L-Uyj Col 1 


12.4 


CCD1 106 (Keratrnocytes) none 


47.0 


LAK cells rest 


23.8 


CCD 1 1 06 (Keratinocytes) 
TNFa1nha'+ IL-lbeta 

x i^<x ci \yjxicx ■ x jo x k/wtc* 


' 22.4 


L,AJv cells IL.-Z 




T ivpr fiTTrin^i ^ < 


12.4 


LAis. ceils LL-z-t-lJL-Iz 


H 1 .0 


none 

iNLrriA"* nunc 


42.9 


T AV ^^11e TT 94-TTTNI 

gamma 


62.4 


NCI-H292 IL-4 


27.2 


LAXcellslL~2+IL-18 


58.6 


NCI-H292 IL-9 


44.1 


LAK cells 
PMA/ionomycin 


5.8 


NCI-H292IL-13 


68.8 ' 


NK Cells IL-2 rest 


27.4 


NCI-H292 IFN gamma ' 


47.6 


Two Way MLR 3 day 


26.2 


HPAEC none 


36.6 


Two Way MLR 5 day 


, 37.9' 


HPAEC TNF alpha + IL-1 beta ; 


43.5 


Two Wav MLR 7 dav 


18.3 


Lung fibroblast none 


56.3 


PBMC rest 


13.6 


Lung fibroblast TNF alpha + IL- 
1 beta 


20.4 


PBMC PWM 


30.1 


Lung fibroblast IL-4 


52.9 


PBMC PHA-L 


55 J 


i^ung iiDroDiasi ii^-y 


75.3 


Ramos (B cell) none 


100.0 


Lung fibroblast IL-1 3 


54.3 


Ramos (B cell) ionomycin 


74.2 


Lung fibroblast IFN gamma 


47 .6 


B lymphocytes PWM 


48.6 


Dermal fibroblast CCD 1070 rest'. 


41.8 


B lymphocytes CD40L 

^ TT A 

ana IL-4 


54.3 


Dermal fibroblast CCD 1 070 ; 

TTKTP nlnlin 

j. xv r uipjjci : 


42.0 


EOL-1 dbcAMP 


23.3 


UClllldl ilUIUDldSl I V / V JLJL<- 

1 beta ' , 


27.5 


EOL-1 dbcAMP 
PMA/ionomycin 


12.9 


Dermal fibroblast IFN gamma 


Zd.5 


Dendritic cells none 


15.9 


Dermal fibroblast IL-4 


50.3 


Dendritic cells LPS 


3.5 


Dermal Fibroblasts rest l 


42.9 


Dendritic cells anti-CD40 


10.7 


Neutrophils TNFa+LPS 


0.0 


Monocytes rest 


13.5 


Neutrophils rest 


0.3 


Monocytes LPS 


5.2 


Colon 


10.5 


Macrophages rest 


6.5 


Lung . 


6.7 


Macrophages LPS 


1.2 


Thymus 


19.3 


HUVEC none 


38.7 


Kidney 


48.3 


HUVEC starved 


: 34.6 






Table 101. Panel 4D 


Tissue Name 


Rel Exp.(%) Ag2541, I Name 
Run 155690634 j 


ReI.Exp.(%) Ag2541, 
Run 155690634 


Secondary Thl act 


11.5 jHUVECIL-lbeta 


8.! 


Secondary Th2 act 


1 0.0 JHUVEC IFN gamma 


15.4 


Secondary Trl act 


HUVEC TNT alpha + IFN 
jgamma 


7.1 
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Secondary Thl rest 


1.0 


HUVEC TNF alpha + IL4 


12.6 


Secondary Th2 rest 


2.3 


HUVECiL-11 


3.6 


Secondary Trl rest 


. . 5.4 , ] 


Lung Microvascular EC none 


17.3 


Primary Thl act 


12.6 


Lung Microvascular EC 
TNFalpha + IMbtfa ; 


8.5 


Primary Th2 act 


12.5 


Microvascular Dermal EC none 


14.4 


Primary Trl act 


21.9 


Microsvasular Dermal EC 
TNFalpha + IL-lbeta 


O.H 


Primary Thl rest 


28.3 


Bronchial epithelium TNFalpha 
+ ILlbeta 1 : 


f\ C 

V.D 


Primary Th2 rest 


13.1 


Small airway epithelium none 


3.5 


Primary Trl rest 


( 14.4 


Small airway epithelium 
TNFalpha ^IL-lbeta 


1 /.I 


CD45RA CD4 lymphocyte 
act' 


13.0 


Coronery artery SMC rest 


i 


CD45RO CD4 lymphocyte 
act 


15.8 , 


Coronery artery SMC TNFalpha 
+ IL-lbeta 


4.0 


CD8 lymphocyte act 


15.1 


Astrocytes rest 


7.4 


Secondary CD 8 
lymphocyte rest 


10.7 | 


Astrocytes TNFalpha + IL-lbeta^ 


2.8 


Secondary CD8 
lymphocyte act 


5.1 


KU-8 12 (Basophil) rest 


1 c n 


CD4 lymphocyte none 


6.9 


KU-8 12 (Basophil) 
PMA/ionomycin 


16.7 


2ryThl/Th2/Trl anti- 
OD95CH11 




ffnl 106 CKeratmocvtes^ none ! 


8.8 


TIT/' _ - 1 1 _~ 0 J 

LAK cells rest 


O.o 


CCD1 106 (Keratinocytes) 
TNFalpha + IL-1 beta 


0.5 


LAK cells IL-2 


10.8 


Liver cirrhosis 


2.1 


LAK cells IL-2+IL-12 


10.4 


Lupus kidney 


1.8 


LAK cells IL-2+IFN 
gamma 


22.5 


NCLH292 none 


30.1 


LAK cells IL-2+ IL-18 


25.7 


NCI-H292 IL-4 




LAK cells 
PMA/ionomycin 


1.3 ; 


NCI-H292 IL-9 


44.8 


NK Cells IL-2 rest 


7.0 


NCl-JtlzyxC IL-li 


I f .3 


Two Way MLR 3 day 


11.4 


NCl-Jtlzy2 IrN gamma 


Z 1 .u 


Two Way MLR 5 day 


3.4 


HPAbC none 




Two Way MLR 7 day 


3.2 


HrAbC IJSIr alpha + LL-i beta 




PBMC rest 


2.9 


Lung fibroblast none 


10.1 


PBMC PWM 


17.0 


Lung fibroblast TNF alpha + EL- 
1 beta 


4.3 


PBMC PHA-L 


9.4 


Lung fibroblast IL-4 


28.3 


Ramos (B cell) none 


28.7 


Lung fibroblast IL-9 


16.8 


Ramos (B cell) ionomycin 


100.0 


Lung fibroblast IL-13 


14.7 


B lymphocytes PWM 


64.2 


Lung fibroblast IFN gamma 


24.0 


B lymphocytes CD40L 
andIL-4 


23.7 


Dermal fibroblast CCD 1070 rest 


25.7 


EOL-1 dbcAMP 


6.6 


Dermal fibroblast CCD 1 070 
TNF alpha 


40.9 


EOL-1 dbcAMP 
PMA/ionomycin 


5.4 


Dermal fibroblast CCD 1070 IL- 
1 beta 


7.5 
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[Dendritic cells hone 


4.0 


Dermal fibroblast TPM oamma 




(Dendritic cells LPS 




Derrrial fibroblast TT -4 




jDendritic cells anti-CD40 


3.4 

— ^ — . 1 — - — 


IBD Colitis 2 


1.0 


(Monocytes rest 


4.0 


IBD Crohn's 


1.0 


[Monocytes LPS* 


0.4 


Colon 


6.0 


(Macrophages rest 


3.5 


Lung 


4.3 


[Macrophages LPS 


0.6 


Thymus 


14.4 


JHUVEC none 


19.3 


Kidney 


, 6.0 


(HUVEC starved 


24.7 

— — ■ ■ * — -■- — — 







expression of the CG50287-01 gene at moderate level in the CNS in an independent group of 
patients. However, no differential expression of this gene was found between Alzheimer's 
disease and control postmortem brains. Please see Panel 1.3D for a discussion of the potential 
utility of this gene in the central nervous system. Results from one experiment (Run 
21 1 141341) with this gene are not included because the amp plot indicates that there were 
experimental difficulties with this 1 run (data not shown). 

Generaljscreeningjpanel_vl.4 Summary: Ag3675 Expression of the CG50287-01 
gene is highest in samples derived from gastric and breast cancer cell lines (CT = 27.6). Thus, 
the expression of this gene could be used to distinguish the gastric and breast cancer cell lines 
from the other samples in the panel. In addition, there is substantial expression of this gene in 
a number of cancer-derived cell lines including brain, colon, ovarian and lung cancer. 
Therefore, therapeutic modulation of the activity of this gene or its protein product, through 
the use of small molecule drugs, antibodies or protein therapeutics, might be beneficial in the 
treatment of gastric, colon, breast, lung and ovarian cancer: . 

This panel confirms the expression of this gene at moderate levels in all central 
nervous system regions examined. Please see Panel 1.3D for a discussion of the potential 
utility of this gene in the central nervous system. 

This gene is expressed at low to moderate levels in the metabolic and endocrine tissues 
on this panel, including in adipose, pancreas, heart, skeletal muscle, liver, adrenal gland, 
pituitary gland, and thyroid. Interestingly, this gene is expressed at higher levels in fetal liver 
(CT = 29,8) than adult liver (CT = 33.4), suggesting that expression of this gene can be used to 
distinguish these two tissues. Furthermore, relative overexpression of this gene in fetal liver 
suggests that the protein product may enhance liver growth or development in the fetus and 
thus may also act in a regenerative capacity in the adult. This gene encodes a protein with 
homology to methionyl aminopeptidase, an important enzyme in post-translational regulation 
of many proteins and peptides. Therefore, therapeutic modulation of this gene or its protein 
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product may' prove useful in the treatment of endocrine and metabolic disorders, including 

i * M " " » ■ , • 

obesity and diabetes. 

Panel 1.3D Summary: ' Ag3675 Expression of the CG50287-01 gene is highest in a 
sample derived from a brain cancer cell line (CT.= 28.6). Thus, expression of this gene could 
5 be used to distinguish brain cancer cell lines from the other samples in the panel. In addition, 
there appears to be substantial expression of this gene associated with a number of cancer- 
derived cell lines including brain, colon, ovarian and breast cancer. Therefore, therapeutic? 
modulation of the activity of this gene or its protein product, through the use of small molecule 
drugs, antibodies or profein therapeutics, might be beneficial in the treatment of colon, breast, 

10 brain and ovarian cancer. 

This gene is expressed at low to moderate levels in all central nervous system regions 
examined, including amygdala, cerebellum, hippocampus, substantia nigra, cerebral cortex, 
thalamus and spinal cord (CTs = 30.6-34.6). This gene encodes a protein with homology to 
methionyl aminopeptidase (Aminopeptidase M)^ Aminopeptidase M acts as a'hypotensive 

15 agent when delivered directly into the paraventricular nucleus of rats. Therefore, therapeutic 
modulation of this gene or its protein product may be of use in the treatment of high blood 
pressure. 

This gene is expressed at low to moderate levels in the metabolic and endocrine tissues 
on this panel, including in adipose, pancreas, heart, skeletal muscle, liver, adrenal gland, 

20 pituitary gland, and thyroid. Therefore, therapeutic modulation of this gene or its protein 
product may prove useful in the treatment of endocrine and metabolic disorders, including 
obesity and diabetes (Batt CM, Jensen LL, Harding JW, Wright JW. Microinfusion of 
aminopeptidase Ivl into the paraventricular nucleus of the hypothalamus in normotensive and 
hypertensive rats. Brain Res Bull 1996;39(4);235-40). 

25 Panel 2D Summary: Ag2541 Expression of the CG50287-01 gene is highest in a 

sample derived from normal prostate tissue adjacent to a prostate cancer (CT = 29.6). Thus, 
expression of this gene could be used to distinguish normal prostate tissue from the other 
samples in the panel. In addition, there is substantial expression of this gene in a number of 
samples derived from cancer tissue in comparison to their respective controls. For instance, 

30 there appears to be substantial expression of this gene in gastric cancer, ovarian cancer, breast 
cancer, lung cancer, uterine cancer and colon cancer. Thus, expression of this gene could also 
be used to distinguish malignant tissue from normal tissue in these cases. Moreover, 
therapeutic modulation of the activity of this gene or its protein product, through the use of 
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small molecule drugs, protein therapeutics or antibodies, could be beneficial in the treatment 
of gastric, ovarian, breast, lung, uterian or colon cancers. 

Panel 4.1D Summary: Ag367'5 The CG50287-01 gerie is expressed at low to 
moderate levels in almost all of the tissues on this panel. However, expression of this gene is 
5 high in the ionomycin-activated Ramos B cell line (CT = 28) as well as in pokeweed mitogen- 
activated isolated peripheral blood B lymphocytes (CT — 29.1). Therefore, antibodies and 
small molecules that antagonize the activity of the CG50287-01 protein may be useful to treat 
the symptoms in patients with autoimmune and inflammatory diseases in which B cells act as 
antigen presenting cells, such as lupus erythematosus, asthma, emphysema, Crohn's disease, 
10 ulcerative colitis, multiple sclerosis, rheumatoid arthritis, osteoarthritis, and psoriasis. 
Panel 4D Summary: Ag2541 Please see Panel 4.1D summary. 
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OTHER EMBODIMENTS 

Although particular embodiments have been disclosed herein in detail, this has been 
done by way of example for purposes of illustration only, and is not intended to be limiting 
with respect to the scope of the appended claims, which follow. In particular, it is 
contemplated by the inventors that various substitutions, alterations, and modifications may be 
made to the invention without departing from the spirit and scope of the invention as defined 
by the claims. The choice of nucleic acid starting material, clone of interest, or library type is 
believed to be a matter of routine for a person of ordinary skill in the art with knowledge of the 
embodiments described herein. Other aspects, advantages, and modifications considered to be 
, within the scope of the following claims. 
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WHAT IS CLAIMED IS: 

1. An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

(a) a mature form of an amino acid sequence selected from the group 
consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 
28, 30, 32, 36, 38, 40, 42, 44, 46, 48 and/or 50; 

(b) a variant o'f a mature form of an amino acid sequence selected from' the 

group consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 

24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, 48 and/or 50, wherein one or 

more amino acid residues in said variant differs from the amino acid 

sequence of said mature form, provided that said variant differs in no 

i 

more than 15% of the amino acid residues from the amino acid 
sequence, of said mature form; 

(c) an amino acid sequence selected from the group consisting of SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 40, 
42, 44, 46, 48 and/or 50; and 

(d) a variant of an amino acid sequence selected from the group consisting 
of SEQ IDNOS:2, 4, 6/8, 10, 12, 14; 16, 18, 20, 22, 24, 26, 28, 30, 32, 
36, 38, 40, 42, 44, 46, 48 and/or 50 wherein one or more amino acid 
residues in said variant differs from the amino acid sequence of said 
mature form, provided that said variant differs in no more than 15% of 
amino acid residues from said amino acid sequence. 

2 The polypeptide of claim 1, wherein said polypeptide comprises the amino acid 

sequence of a naturally-occurring allelic variant of an amino acid sequence selected 
from the group consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 
28, 30, 32, 36, 38, 40, 42, 44, 46, 48 and/or 50. 

3. The polypeptide of claim 2, wherein said allelic variant comprises an amino acid 
sequence that is the translation of a nucleic acid sequence differing by a single 
nucleotide from a nucleic acid sequence selected from the group consisting of SEQ ID 
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NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 34, 35, 37, 39, 41, 43, 
45, 47 and/or 49. 

The polypeptide of claim 1, wherein trie amino acid sequence of said variant comprises 
a conservative amino acid substitution. 

An isolated nucleic acid molecule comprising a nudeic acid sequence encoding a 
polypeptide comprising ah amino acid sequence selected from the group consisting of: 

(a) ' a mature form of an amino acid sequence selected from the group 

consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 
28, 30, 32, 36, 38, 40, 42, 44, 46, 48 and/or 50; 

(b) a variant of a mature form of an amino acid sequence selected from the 
group consisting of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 
24, 26, 28, 30, 32, 36, 38, 40, 42, 44, 46, 48 and/or 50, wherein one or 
more amino acid residues in said variant differs from the amino acid 
sequence of said mature form, provided that said variant differs in no 
more than 15% of the amino acid residues from the amino acid 
sequence of said mature form; 

(c) an amino acid sequence selected from the group consisting of SEQ ID 
NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 38, 40, 
42, 44, 46, 48 and/or 50; 

(d) a variant of an amino acid sequence selected from the group consisting 
of SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 
36, 38, 40, 42, 44, 46, 48 and/or 50, wherein one or more amino acid 
residues in said variant differs from the amino acid sequence of said 
mature form, provided that said variant differs in no more than 15% of 
amino acid residues from said amino acid sequence; 

(e) a nucleic acid fragment encoding at least a portion of a polypeptide 
comprising an amino acid sequence chosen from the group consisting of 
SEQ ID NOS:2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 
38, 40, 42, 44, 46, 48 and/or 50, or a variant of said polypeptide, 
wherein one or more amino acid residues in said variant differs from the 
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amino acid sequence of said mature form, provided that said variant 
differs in no more than 15% of amino acid residues from said amino 
acid sequence; and 1 '• 

(f) a nucleic acid molecule comprising' the complement of (a), (b), (c), (d) 
or (e). 

The nucleic acid molecule of claim 5, wherein the nucleic acid molecule comprises the 
nucleotide sequence of a naturally- occurring allelic nucleic acid variant. 

The nucleic acid molecule of claim 5, wherein the nucleic acid molecule encodes a 
polypeptide comprising the amino acid sequence of a naturally-occurring polypeptide 
variant. 

The nucleic acid molecule of claim 5, wherein the nucleic acid molecule differs by a 
single nucleotide from a nucleic acid sequence selected from the group consisting of 
SEQ ID NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 34, 35, 37, 39, 
41,43,45, 47 and/or 49. 

The nucleic acid molecule of claim 5, wherein said nucleic acid molecule comprises a 
nucleotide sequence selected from the group consisting of 

(a) a nucleotide sequence selected from the group consisting of SEQ ID 
NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 34, 35, 
37, 39, 41, 43, 45, 47 and/or 49; 

(b) a nucleotide sequence differing by one or more nucleotides from a 
nucleotide sequence selected from the group consisting of SEQ ID 
NOS:l, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 34, 35, 
37, 39, 41, 43, 45, 47 and/or 49, provided that no more than 20% of the 
nucleotides differ from said nucleotide sequence; 

(c) a nucleic acid fragment of (a); and 

(d) a nucleic acid fragment of (b). 

The nucleic acid molecule of claim 5, wherein said nucleic acid molecule hybridizes 
under stringent conditions to a nucleotide sequence chosen from the group consisting 
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of SEQ ID NOSrl, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 34, 35, 37, 
39, 41, 43, 45, 47 and/or 49, or a complement of said nucleotide sequence. 

1 1 . The niicleic acid molecule of claim 5, wherein the nucleic acid molecule comprises a 
nucleotide sequence selected from the group consisting of 

(a) a first nucleotide sequence comprising a coding sequence differing by 
one or more nucleotide sequences from a coding. sequence encoding 

( said amino' acid sequence, provided that no more than 20% of the 
1 nucleotides in the coding sequence in said first nucleotide sequence 

differ from said coding sequence; 

(b) an isolated second polynucleotide that is a complement of the first 
polynucleotide; and i 

(c) a nucleic acid fragment of (a) or (b). 

i ■ 

12. A vector comprising the nucleic acid molecule of claim 1 1 . 

13. The vector of claim 12, further comprising a promoter operably-linked to said nucleic 
acid molecule. 

14. A cell comprising the vector of claim 12. 

15. An antibody that immunospeciflcally-binds to the polypeptide of claim 1. 

16. The antibody of claim 15, wherein said antibody is a monoclonal antibody. 

17. The antibody of claim 15, wherein the antibody is a humanized antibody. 

18. A method for determining the presence or amount of the polypeptide of claim 1 in a 
sample, the method comprising: 

(a) providing the sample; 

(b) contacting the sample with an antibody that binds immuno specifically 
to the polypeptide; and 

(c) determining the presence or amount of antibody bound to said 
polypeptide, 
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thereby determining the presence or amount of polypeptide in said sample. 

19. A method for determining the presence or amount of the nucleic acid molecule of 
claim 5 in a sample, the method comprising: 

(a) providing the sample; 

(b) contacting the sample with a probe that binds to said nucleic acid 
molecule; and , 

(c) determining the presence or amount of the probe bound to said nucleic 
acid molecule, 

thereby determining the presence or amount of the nucleic acid molecule in said 
sample. 

20. A method of identifying an agent that binds to a polypeptide of claim 1, the method 
comprising: 

(a) contacting said polypeptide with said agent; and 

(b) determining whether said agent binds to said polypeptide. 

21. A method for identifying an agent that modulates the expression or activity of the 
polypeptide of claim 1, the method comprising: 

(a) providing a cell expressing said polypeptide; 

(b) contacting the cell with said agent; and 

(c) determining whether the agent modulates expression or activity of said 
polypeptide, 

whereby an alteration in expression or activity of said peptide indicates said agent 
modulates expression or activity of said polypeptide. 

22. A method for modulating the activity of the polypeptide of claim 1, the method 
comprising contacting a cell sample expressing the polypeptide of said claim with a 
compound that binds to said polypeptide in an amount sufficient to modulate the 
activity of the polypeptide. 

23. A method of heating or preventing a NOVX-associated disorder, said method 
comprising administering to a subject in which such treatment or prevention is desired 
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the polypeptide of claim 1 in an amount sufficient to treat or prevent said NOVX- 
associated disorder in said subject. 

24. The method of claim 23, wherein said subject is a human. 

25. A method of treating or preventing a NOVX-associated disorder, said method 
comprising administering to a subject in which such treatment or prevention is desired 
the nucleic acid of claim 5 in an amount sufficient to treat or prevent said NOVX- 
associated disorder in said subject. 

26. 1 The method of claim 25, wherein said subject is a human. 

27. A method of treating or preventing a NOVX-associated disorder, said method 
comprising administering to a subject in which such treatment or prevention is desired 
the antibody of claim 15 in an amount sufficient to treat ctt prevent said NOVX- 
associated disorder in said subject. 

28. The method of claim 27, wherein the subject is a human. 

29. A phannaceutical composition comprising the polypeptide of claim 1 and a 
pharmaceutically-acceptable carrier. 

30. A pharmaceutical composition comprising the nucleic acid molecule of claim 5 and a 
pharmaceutically-acceptable carrier. 

31 . A pharmaceutical composition comprising the antibody of claim 1 5 and a 
pharmaceutically-acceptable carrier. 

32. A kit comprising in one or more containers, the phannaceutical composition of claim 
29. 

33. A kit comprising in one or more containers, the pharmaceutical composition of claim 
30. 
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34. A kit Comprising in one or more containers, the pharmaceutical composition of claim 
31. 

35. The use of a therapeutic in the manufacture of a medicament for treating a syndrome 
associated with a human disease, the disease selected from a NOVX-associated 
disorder, wherein said therapeutic is selected from the group consisting of a NOVX 

polypeptide, a NOVX nucleic acid,,and a NOV>[ antibody. 

i 

36. A method for screening for a modulator of activity or of latency or predisposition to a 
NOVX-associated disorder, 'said method comprising: 

, (a) administering a test compound to a test animal at increased risk for a 
NOVX-associated disorder, wherein said test animal recombinantly 
expresses the polypeptide of claim 1; 

(b) measuring the activity of said polypeptide in said test animal after 
administering the compound of step (a); 

(c) comparing the activity of said protein in said test animal with the 
activity of said polypeptide in a control animal not administered said 
polypeptide, wherein a change in the activity of said polypeptide in said 
test animal relative to said control animal indicates the test compound is 
a modulator of latency of or predisposition to a NOVX-associated 
disorder. 

37. The method of claim 36, wherein said test animal is a recombinant test animal that 
expresses a test protein transgene or expresses said transgene under the control of a 
promoter at an increased level relative to a wild-type test animal, and wherein said 
promoter is not the native gene promoter of said transgene. 
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38. A method for determining the presence of or predisposition to a disease associated with 
altered levels of the polypeptide of claim 1 in a first mammalian subject, the method 
comprising: 

(a) measuring the level of expression of the polypeptide in a sample from 
the first mammalian subject; and 

(b) comparing the amount of said polypeptide in the sample of step (a) to 
the amount of the polypeptide present in a control sample from a second 

' mammalian subject known not to have, or not to be predisposed to, said 
disease, 

wherein an alteration in the expression level of the polypeptide in the first subject as 
compared to the control sample indicates the presence of or predisposition to said 
disease. 

i • 

39. A method for determining the presence of or predisposition to a disease associated with 
altered levels of the nucleic acid molecule of claim 5 in a first mammalian subject, the 
method comprising: 

(a) measuring the amount of the nucleic acid in a sample from the first 
mammalian subject; and 

(b) comparing the amount of said nucleic acid in the sample of step (a) to 
the amount of the nucleic acid present in a control sample from a second 
mammalian subject known not to have or not be predisposed to, the 
disease; 

wherein an alteration in the level of the nucleic acid in the first subject as compared to 
the control sample indicates the presence of or predisposition to the disease. 

40. A method of treating a pathological state in a mammal, the method comprising 
administering to the mammal a polypeptide in an amount that is sufficient to alleviate 
the pathological state, wherein the polypeptide is a polypeptide having an amino acid 
sequence at least 95% identical to a polypeptide comprising an amino acid sequence of 
at least one of SEQ ID NOS:2, 4, 6, 8 3 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 36, 
38, 40, 42, 44, 46, 48 and/or 50, or a biologically active fragment thereof 
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41 . A method of treating a pathological state in a mammal, the method comprising 

admimstering to the mammal the antibody of claim 15 in an amount sufficient to 
alleviate the pathological state. 
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